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PREFACE 


The celebrated physicist and mathematician A. M. Ampére coined the 
word cybernétique to mean the science of civil government (Part II of 
“ Essai sur la philosophie des sciences,” 1845, Paris). Ampére’s grandiose 
scheme of political sciences has not, and perhaps never will, come to 
fruition. In the meantime, conflict between governments with the use 
of force greatly accelerated the development of another branch of science, 
the science of control and guidance of mechanical and electrical systems. 
It is thus perhaps ironical that Ampére’s word should be borrowed by 
N. Wiener to name this new science, so important to modern warfare. 
The “cybernetics” of Wiener (“‘Cybernetics, or Control and Communi- 
cation in the Animal and the Machine,” John Wiley & Sons, Inc., New 
York, 1948) is the science of organization of mechanical and electrical 
components for stability and purposeful actions. A distinguishing fea- 
ture of this new science is the total absence of considerations of energy, 
heat, and efficiency, which are so important in other natural sciences. 
In fact, the primary concern of cybernetics is on the qualitative aspects 
of the interrelations among the various components of a system and the 
synthetic behavior of the complete mechanism. 

The purpose of “Engineering Cybernetics” is then to study those parts 
of the broad science of cybernetics which have direct engineering appli- 
cations in designing controlled or guided systems. It certainly includes 
such topics usually treated in books on servomechanisms. But a wider 
range of topics is only one difference between engineering cybernetics 
and servomechanisms engineering. A deeper—and thus more impor- 
tant—difference lies in the fact that engineering cybernetics is an engi- 
neering science, while servomechanisms engineering is an engineering 
practice. An engineering science aims to organize the design principles 
used in engineering practice into a discipline and thus to exhibit the 
similarities between different areas of engineering practice and to empha- 
size the power of fundamental concepts. In short, an engineering science 
is predominated by theoretical analysis and very often uses the tool of 
advanced mathematics. A glance at the contents of this book makes 
this quite evident. The detailed construction and design of the com- 
ponents of the system—the actual implementation of the theory—are 
almost never discussed. No gadget is mentioned. 
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What is the justification of this separation of the theory from the 
practice? With knowledge of the very existence of various engineering 
sciences and their recent rapid development, such justification seems 
hardly necessary. Moreover, a specific example could be cited: Fluid 
mechanics exists as an engineering science separate from the practice of 
aerodynamics engineers, hydraulic engineers, meteorologists, and many 
others who use the results of investigations in fluid mechanics in their 
daily work. In fact, without fluid mechanists, the understanding and 
the utilization of supersonic flows would certainly be greatly delayed, to 
say the least. Therefore, the justification of establishing engineering 
cybernetics as an engineering science lies in the possibility that looking 
at things in broad outline and in an organized way often leads to fruitful 
new avenues of approach to old problems and gives new, unexpected 
vistas. At the present stage of multifarious developments in control 
and guidance engineering, there is a very real advantage in trying to 
grasp the full potentialities of this new science by a comprehensive survey 
of the whole field. l 

Therefore a discussion on engineering cybernetics should cover reason- 
ably well all aspects of the science expected to have engineering appli- 
cations and, in particular, should not avoid a topic for the mere reason of 
mathematical difficulties. This is all the more true when one realizes 
that the mathematical difficulties of any subject are usually quite 
artificial. With a little reinterpretation, the matter could generally be 
brought down to the level of a research engineer. The mathematical 
level of this book is then that of a student who has had a course in ele- 
ments of mathematical analysis. Knowledge of complex integration, 
variational calculus, and ordinary differential equations forms the pre- 
requisite for the study. On the other hand, no rigorous and elegant 
mathematical argument is introduced if a heuristic discussion suffices. 
Hence to the practicing electronics specialist, the treatment here must 
appear to be excessively “long-hair,” but to a mathematician interested 
in this field, the treatment here may well appear to be amateurish. If 
indeed these are the only criticisms, then, with all due respect to them, 
the author shall feel that he has not failed in what he aimed to do. 

During the course of writing these chapters, the author had the benefit 
of many conversations with his colleagues at the California Institute of 
Technology, Dr. Frank E. Marble and Dr. Charles R. DePrima, which 
often led to sudden clarification of an obscure point. The task of pre- 
paring the manuscript was greatly lightened by the efficient help rendered 
by Sedat Serdengecti and Ruth L. Winkel. To all of them, the author 
wishes to extend his sincere thanks. 
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CHAPTER 1 


INTRODUCTION 


Consider a system of one degree of freedom so that the physical state 
of the system can be specified by a single variable y. The behavior of 
the system is then described by taking y as a function of time t. To 
determine this behavior or y(¢), it is necessary to know the structure 
of the system and the properties of the individual elements of the system. 
This knowledge about the system, together with fundamental physical 
laws, when translated into mathematical language gives an equation for 
calculating the function y(¢). This equation could be an integral equa- 
tion or an integrodifferential equation, but very often it is a differential 
equation. It is also an ordinary differential equation, because there is 
only one independent variable, the time t. 

A differential equation is called linear, and the system described by the 
differential equation, a linear system, if each term of the equation con- 
tains at most only first powers of the dependent variable y or its time 
derivatives. The terms should not contain higher powers of y or cross 
products of y and its derivatives. Otherwise, the differential equation 
is called nonlinear, and the system described by the differential equation, 
a nonlinear system. Linear systems can be further subdivided into 
systems with constant coefficients and systems with variable coefficients. 
Constant-coefficient systems have constants independent of time ¢ as 
coefficients of the terms in the differential equation describing the system. 
Variable-coefficient systems have coefficients that are functions of t. 

This concern about the classification of the differential equation has 
its justification in that the character of the solution of the equation 
and hence the behavior of the system depend closely on the type of the 
differential equation which describes it. Even more than this, the type 
of differential equation specifies the kind of questions that can logically 
be asked about the system. In other words, the type of differential 
equation determines the proper approach to the solution of the engineer- 
ing problem of the system. We shall see this presently. 

1.1 Linear Systems of Constant Coefficients. Let us consider the 
simplest system—a first-order system. That is, the differential equation 
of the system is a first-order linear differential equation of constant 


coefficients. If the system is assumed to be free and is not subjected to 
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‘forcing functions,” then the differential equation can be written as 
dy 
od = 1.1 


k may be called the spring constant and is real. When there is no varia- 
tion of y with respect to time, dy/dt vanishes, and Eq. (1.1) requires 
y =0. Therefore the stationary, or equilibrium, state of the system 
corresponds to y = 0. 

The solution of Eq. (1.1) is 


yY = yor ™ (1.2) 
where yo is the initial value of y, or 
y(0) = Yo (1.3) 


yy is thus the initial disturbance of the system from the equilibrium 
state. The behavior of the system for ¢ > 0 is illustrated in Fig. 1.1 
for both positive and negative k. Itis 


: seen that for k > 0 the magnitude of y 
<0 decreases with time. Then, as the 
j time increases indefinitely, y— 0. 
R>0 Therefore, for k > 0, the disturbance 


of the system will eventually disap- 
pear. The system can then be said to 
be stable. When k < 0, the motion of 
the system increases with time, and eventually the disturbance will 
become very large no matter how small the initial displacement is: the 
system will never return to the equilibrium state once disturbed. Such 
systems are thus unstable. 

For systems of higher order, the differential equation will have higher 
derivatives. The nth-order system has the differential equation 


Fig. 1.1 


dy qr- ty 
g T Oiga T o t ay =O (1.4) 


For a physical system, the coefficients an-ı, . . . , @o are real. Then the 
solution of Eq. (1.4) can be written as 


y= ) ype sin (Bt + g) (1.5) 
i=l 
where a; 8; are real and are related to the coefficients dnt, . . . , do, 
and the øs are the phase angles. The motion of the system is thus 
stable only if all «;’s are negative. If one of them is positive, the dis- 
turbance will eventually diverge, and the system is thus unstable. 
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From the above examples it is seen that the crucial question to ask 
about the behavior of a linear system of constant coefficients is the 
question of stability. Needless to say, the usual aim of an engineering 
design is stability. The question of stability can be answered, however, 
once the coefficients of the differential equation are specified. In the 
case of the simple first-order system specified by Eq. (1.1), the only 
information that matters is the sign of the coefficient k. 

1.2 Linear Systems of Variable Coefficients. If there is a variable 
parameter in the system under study, the stationary, or equilibrium, 
state of the system can be changed by changing this parameter. It is 
natural, then, to expect the coefficients of the linear differential equation 
describing the system to be also functions of this parameter. For 
instance, the aerodynamic forces acting on an aircraft are functions of 
the speed of the aircraft. If the speed of the aircraft is changing owing 
to acceleration or deceleration, the aerodynamic forces will change 
accordingly while the inertial properties of the aircraft remain practically 
the same. As a result, if we wish to calculate the disturbed motion of the 
aircraft from, say, horizontal fight, the fundamental differential equa- 
tion will be an equation with variable coefficients. 

Let us return to the simple example of a first-order system, described 
by Eq. (1.1). If the spring constant k is a function of the speed of the 
aircraft and if the aircraft has a constant acceleration a, then k isa 
function of the velocity u = at. Thus the differential equation can be 
written as 


oY + badly = 0 (1.6) 
The solution of this equation is 


at 
log = — 5 I k(£) dé (1.7) 


where yo is the initial disturbance. If kis always positive then log (y/yo) 
is always negative, and as time increases log (y/yo) will be increasingly 
negative. Therefore y is always less than yo and eventually will vanish. 
Thus the system is stable. If k is always negative, log (y/yo) will be 
increasingly positive with time. Then y will eventually become very 
large even if the initial disturbance yo is very small. The system is thus 
unstable. These characteristics of the linear system with variable 
coefficients that remain positive or negative are very similar to those of 
systems with constant coefficients. 

The interesting case is, however, when k has both positive and negative 
values. Let us assume k(at) to be first positive, then negative, but 
finally positive again. If the first zero of k is denoted by u: = atı 
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and the second zero denoted by uz = af, then according to our previous 
concepts, the system is unstable in the velocity range from u to uz (Fig. 
1.2). Let ymin be the minimum value of y and ymax be the maximum value 
of y. Then Eq. (1.7) gives 


Ymin I m 
a= — k 1.8 
jog lat E S O a (1.8) 
and 
Yms o Lf d 1.9 
log yh ah (£) dé (1.9) 


Of primary engineering interest is the question: How large is Ymx? 
Is it so large that the system cannot function properly? We note that 


Yo Ymax 
y 
Yimin 
atı ate k ( at) 
“Unstable” 
zone 
Fra. 1.2 


to answer this question the knowledge of two things, in addition to the 
functional dependence of k upon u, is required. These are: How large 
is the acceleration a? What is the magnitude of the initial disturbance 
yo? For any fixed a, Ymax is proportional to yo. But more important, 
for any fixed initial disturbance, the maximum value of the deviation 
Ymax can be greatly reduced by increasing the acceleration a, as shown by 
Eq. (1.9). This means that by going through the “unstable”? zone 
quickly, the undesirable effects can be minimized. 

Therefore, for the more general linear systems with variable coefficients, 
the simple question of stability has no definite meaning. The more 
meaningful question is to ask specifically whether under a definite cri- 
terion the system will behave satisfactorily with specified disturbances 
and circumstances. In our simple example of a first-order system, the 
definite criterion of proper behavior is Ymax; the specified disturbance is 
yo, and the specified circumstance is the acceleration a. Thus by going 
from systems with constant coefficients to systems with variable coeffi- 
cients, the character of the problem is already considerably modified. 
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1.3 Nonlinear Systems. If the spring constant k of the simple frst- 
order system described by Eq. (1.1) is a function of the disturbance y 
itself, then the differential equation is 


W + ily) =0 (1.10) 


where f(y) = k(y)y. It is seen that the differential equation is nonlinear. 
The system described by Eq. (1.10) is thus the simplest example of a 
nonlinear system. The solution y(t) can be computed from the following 
relation obtained by integrating Eq. (1.10): 


Y dy 
t= — == 1.11 
Yo j (n) ( ) 
where yo is again the initial disturbance. 
On the other hand, repeated differentiation of Eq. (1.10) gives 
dy , df dy _ 

di? dy dt 
ay 2 (wy 4 Lay (1.12) 

diè dy? \dt dy dt? 


e è> èe èo © èe e@ č s  e@  e@*  e@ ċē e \& ; b ò «« ç ‘œ 


Thus if yı is a zero of the function f(y) and if f(y) is regular at yı so that 
all the derivatives of f(y) with respect to y are finite at yı, then from 
Eqs. (1.10) and (1.12) 


i ae ae ee at y = yı (1.13) 
This means that y approaches yı asymptotically. In fact, if yo > yı and 
f(yo) > 0, then y will become yı eventually. If yo < yı, then f(yo) < 0, 
and y will again become yı at t— œ. This pattern of behavior of y 
is repeated with other zeros of the f(y) (Fig. 1.3). 

If the initial disturbance y; coincides with one of the zeros of f(y), 
this value of y will be maintained with increasing time. Thus the zeros 
of f(y) are equilibrium positions. If df/dy > 0 at a zero such as y, 
small deviations from this equilibrium position will eventually disappear 
and the system will finally return to the initial state. Thus the system 
may be said to have stability for small disturbances at yı. If, however, 
df/dy < 0 at a zero such as yo, the slightest disturbance from this equi- 
librium position will cause the system to move to the next equilibrium 
positions yi or ys. Y2 is thus an unstable equilibrium state. 

We have seen that even for the very simple nonlinear system described 
by Eq. (1.10), the behavior of the system is very complicated. The 
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system may have both stability and instability. Thus for such systems, 
it is entirely senseless to ask a general question about stability; rather, 
each specific problem must be considered individually. 


Fig. 1.3 


1.4 Engineering Approximation. It is almost certain that any physi- 
cal system, if analyzed in great detail, is always nonlinear. We speak 
of the system as linear only with the understanding that we mean the 
system can be approximated sufficiently accurately by a linear system. 
Furthermore, sufficient accuracy means that the deviation from linearity 
is so small as to be unimportant for the specific problem considered. 
Thus whether a system is linear or not can be determined only under 
clearly defined circumstances. There is no general absolute criterion. 

The same can be said for the classification of linear systems into systems 
of constant coefficients and systems with variable coefficients. Take our 
simple examples described by Eqs. (1.1) and (1.6). If the acceleration a 
is very small, i.e., flight at almost constant speed, Eq. (1.8) shows that 
Ymin Will be very much smaller than the initial disturbance yo, and this 
value of Ymin Will occur at a very large value of t. The behavior of the 
system within a finite time interval is thus very similar to a system 
described by Eq. (1.1) with positive k. Therefore the system of variable 
coefficients can be sufficiently accurately approximated by a system with 
constant coefficients under certain circumstances. 

Needless to say, linear systems with constant coefficients are the 
easiest to study. It is fortunate that a very large number of engineering 
systems falls into this classification when the “engineering approxima- 
tion” is made. This is the reason why this particular field of stability 
and control theory is the most developed one. In fact, the present 
theory of servomechanism deals almost exclusively with such systems. 
We shall therefore begin with linear systems of constant coefficients. 


CHAPTER 2 


METHOD OF LAPLACE TRANSFORM 


For linear differential equations with constant coefficients and with 
time ¢ as the independent variable, the method using Laplace transforms 
is particularly useful in finding the solution. Of course the problem can 
be solved by a number of other methods; but the Laplace-transform 
method appeals especially to the engineering scientist in that it reduces 
all problems to a uniform basis. The procedure of solution is then 
standardized, and a general approach is possible. The theory and 
practice of the Laplace transform are discussed in many texts.‘ It is 
not the purpose of the present chapter to do this. The purpose here is 
rather to give for easy reference a summary of results which are useful 
to our discussion in the subsequent chapters. For details and proofs, the 
reader should consult the texts cited. 

2.1 Laplace Transform and Inversion Formula. If y(¢) is a function 
of the time variable t defined for t > 0, the Laplace transform Y(s) of 
y(t) is defined as? 


Y(s) = h ” e-ety(t) dt (2.1) 


where s is a complex variable having a positive real part, Qs > 0. For 
other values of s, the function Y(s) is defined by the analytic continuation. 
The dimension of Y(s) is the dimension of y multiplied by time. 

When. Y(s) is known, the original function for which Y(s) is the Laplace 
transform can be obtained in all cases by the inversion formula: 


rte 
ns =< a etY (8) ds (2.2) 


1 See for instance H. S. Carslaw and J. C. Jaeger, “Operational Methods in Applied 
Mathematics,” Oxford University Press, New York, 1941; or R. V. Churchill, “Modern 
Operational Methods in Engineering,” McGraw-Hill Book Company, Inc., New York, 
1944. For a more complete theory, one should consult G. Doetsch, “Theorie und 
Anwendung der Laplace-Transformation,’’ Verlag Julius Springer, Berlin, 1937; or 
D. V. Widder, ““The Laplace Transform,” Princeton University Press, Princeton, N. J., 
1946. 

2 Throughout this book, capital letters will be used to denote the Taplee transform 
of quantities denoted by lower-case letters. 
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where y is a constant greater than the real part of all the singularities of 
Y(s). The actual evaluation of y(t) can be done by properly deforming 
the path of integration according to the character of Y(s). 

2.2 Application to Linear Equations with Constant Coefficients. 
Since the Laplace transform is defined as an operation on a function 
defined for t > 0, the method is particularly adapted to initial-value 
problems: Given the initial state of the system and the forcing function 
for t > 0, find the “motion” of the system for ¢> 0. Let us consider 
an nth-order system, with coefficients an, Gri, . . - , 4 for the deriva- 
tives, and a nonhomogeneous term, or forcing function, z(t). Then the 
differential equation is 


: ae wee = 
ae + ani apr! T F Qoy a(t) (2.3) 


di”! J i0 s 
(2.4) 


(Y)i=0 = Yo ) 


The differential equation (2.3) together with the condition (2.4) deter- 
mines uniquely the behavior of the system for t > 0. 

To solve the problem by the Laplace transformation, we multiply both 
sides of Eq. (2.3) by e” and integrate from t = 0 tot = œ. Then 


i ” ersty(t) dt = Y(s) (2.14) 


And by partial integration, 


| ens dY a = —Y4y +8 l y(t) dt = — yo + sY (8) 


di 
2 2 
i ee E dt = —yf — syo + Y (8) 
a a e e en (2.5) 
and 
j d™ 
I es a dt = — yp neen syran daei a a a gly + s"Y(s) 


Therefore, if the Laplace transform of the forcing function 2(é) is written 
as X(s), t.e., 


X(s) = i ON (2.6) 
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then Eq. (2.3) together with the initial conditions (2.4) can be written as 


(ans” + ans! + ++ + + as + a) YCS) = anys" 
+ (any F anyo) H (Any? + dary? F anayo) + + +: 
F (any H aryf? Pos + + H aoyo) + X(s) (2.7) 


Hence if we define the polynomials D(s) and No(s) as 


D(s) = ans” + anas"! + + + + + as + ao (2.8) 
and 


No(s) = dayos*? + (Gay + aniy + + 
5 (aryf? T Any” 7 ee ig oY (2.9) 


then the solution of Eq. (2.7) is 


_ No(s) X8) 

We note that the first term of the solution given by Eq. (2.10) depends 
on the initial conditions through Eq. (2.9). No(s) is at most of order 
n — 1 and is thus a lower order than D(s). No(s) will vanish if all the 
initial values specified by Eq. (2.4) vanish. In that case, Y(s) is given 
by the second term alone. The second term depends upon the forcing 
function. Therefore the first term, No(s)/D(s), can be called the comple- 
mentary function, and the second term X(s)/D(s), the particular inte- 
gral. The actual solution y(¢) can be obtained from Y(s) of Eq. (2.10) 
by applying the inversion formula of Eq. (2.2). 

2.3 “Dictionary” of Laplace Transforms. The forcing function z(t) 
is often of a character such that X(s) is the ratio of two polynomials in s. 
Then the complete solution Y(s) given by Eq. (2.10) is also the ratio 
of two polynomials of s. Therefore the expressions for Y(s) can be 
broken down into a number of simple fractions. Each of the fractions 
can be inverted by the inversion formula, or, better yet, the original 


TABLE 2.1 
DICTIONARY OF LAPLACE TRANSFORMS 
Y(s) y(t) 
1/s 1 
1/3” PLT (n) 
l/s — a e% 
a/(s? + a?) sin at 
s/(s? + a?) cos at 
a/(s? — a?) sinh ai 
s/(s? — a?) cosh at 
s/(s? + a)? = sin al 


1/(s? + a*)? os (sin at — at cos at) 
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functions of ¢ can be found by the use of a “dictionary,” a list of simple 
functions of t and their Laplace transforms. A much abridged dictionary 
is given in Table 2.1. 

9.4 Sinusoidal Forcing Function. Theratio of polynomials No(s)/D(s) 
can be broken down into partial fractions. If the roots of the poly- 
nomial D(s) are all different, say 1, S23, . . . , $n, then 


n 


N,(s) », No(s,) i 


De) Ly DG) E= &) 
r=] 
where D'(s) is the derivative of D(s) with respect to s. By “interpret- 
ing” the sum term by term according to our dictionary, the part ye(t) 
of the solution due to the initial conditions, or the complementary 


function, is 
n 


a N o(r) Srt 

Y(t) i y D'(s,) E (2.11) 
In general, the roots s, of D(s) are complex. For physical systems, the 
a’sin D(s), Eq. (2.8), are real; then the s,’s have complex conjugate pairs. 
But if all s,’s have negative real parts, then y(t) will decrease exponen- 
tially with respect to time, and eventually y,(t)-> 0. Then the system 


is stable. 
If the forcing function z(t) is sinusoidal, it can be written as 


z(t) = Ime! (2.12) 


where £m is the amplitude and w is the circular frequency. Then, accord- 
ing to the dictionary, 


Therefore the second term of Eq. (2.10) is now 


Lm, 
(s — iw) D(s) 
We can generalize this to include systems determined by a set of simul- 
taneous equations by putting another polynomial N(s) of order lower 
than n in the numerator. Then the Laplace transform Y,(s) of the 
particular integral is 
N(s) EmN (8) 


Y;(8) = F(s) X (s) = Dey a = (s — iw) D(s) (2.18) 
When N(s) = 1, the problem is reduced to the simpler one specified by 
Eq. (2.10). Now the partial-fraction rule can be applied again. But the 
polynomial in the denominator is now (s — tw)D (s), and the roots are s;, 
Se... , Sa and iw. Thus 
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N(iw) 1 N et 1 
Y;(s) = po D liw) (s — tw) _ ar ee ls, — tw) D (s) D(s,) (s == wos | (2.14) 


Therefore the particular integral y,(¢) due to a sinusoidal forcing function 
of form (2.12) is 


N (iw 
vo me [Me S e] em 


For stable systems, all s,’s have negative real parts. Thus the second 
part of y,(t) vanishes ast— ©. The remaining part is the steady solu- 
tion, and the ratio of steady solution to the forcing function is thus 
simply given by 


(yE) |stenay _ N (tw) 

“l D F (iw) (2.16) 
This equation gives a very direct way of computing the steady-state 
solution under a sinusoidal forcing function. 

When the frequency w of the forcing function decreases to zero, the 
forcing function is reduced to a constant, nonvarying with respect to 
time. Equation (2.16) then indicates that F(0) is the ratio of y to z 
when x is a constant. This is the physical meaning of the value of F(s) 
ats = 0. We shall use this interpretation frequently in our discussions. 

2.5 Response to Unit Impulse. The forcing function x(t) need not 
be a continuous function. It may be a unit impulse applied at the time 
instant t = 0, t.e., 


a(t) = 0 for t #0 
x(t) > œ fort = 0 


h 2@ at =1 


Then the Laplace transform X(s) of the forcing function is simply equal 
to 1. The Laplace transform of the response to the unit impulse of the 
generalized system as described by Eq. (2.13) is simply 


and 


Vile) = Fa 1 = Fle) (2.17) 


The solution y( due to this unit impulse is usually denoted as A(t). 
According to our inversion formula, Eq. (2.2), 


1 ytie 
ale ft st 
h(t) 5G eF (s) ds (2.18) 
When the system is stable, the roots s, all have negative real parts. 
Then the singularities of F(s) all lie to the left of the imaginary axis of 
the complex s plane. Then the imaginary axis can be taken as the path 
of integration for the A(t), that is, y in Eq. (2.18) can be set equal to zero. 


CHAPTER 3 
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We have seen in the previous chapter that by the application of the 
Laplace transform, the behavior of a linear system with constant coeffi- 
cients is made to depend essentially on the polynomial D(s), given by 
Eq. (2.8), formed out of the coefficients of the differential equation. 
Even in the generalized case, if the initial values of y(t) and the initial 
derivatives necessary to specify the problem are all zero, the behavior of 
the system is completely determined by the ratio N(s)/D(s) of two poly- 
nomials. This ratio is denoted as F(s). If the Laplace transform of the 
forcing function is X(s) and the Laplace transform of the particular 
integral is Y,(s), Eq. (2.13) gives 


Y,(s) = F(s)X(s) (3.1) 


This equation can be considered as an operator equation: X(s) when 
operated by F(s) gives Y;(s), or F(s) transfers X(s) into Y,(s). There- 
fore F(s) is called the transfer function. X(s) is the Laplace transform 
of the input x(t), and Y;(s) is the Laplace transform of the output y;,(t). 
In order to specify the fact that y:(t) implies the particular integral only, 
without the complementary function introduced by the initial conditions, 
y;(t) is called the output due to input. Then the complementary function 
Y(t) is called the output due to initial conditions. 

The advantage of the Laplace transform method is thus to reduce a 
problem in differential equations to one of algebraic operation. The step 
of going from Y(s) to y(t) is seldom necessary, because the behavior of 
y(t) is fully determined by Y(s). Thus it is possible to translate the 
engineering requirements on y(t) to a set of requirements on Y(s) or, 
with the input characteristics specified, to a set of requirements on F(s), 
the transfer function. The study and design of a system by means of 
the transfer function are the fundamental technique in servomechanism 
engineering. In this chapter, we shall expound this technique by a 
series of examples. 

3.1 First-order Systems. As a first example, consider a cantilever 
spring which has a dashpot at one end and a sliding movement on the 
other end (Fig. 3.1). The position of the end with the dashpot is denoted 

12 
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by y(t), the position of the sliding end by z(t). Because of the dashpot, 
y(t) is not equal to z(t) but lags behind z(t). The problem is to study 
the output y(t) when the sliding end is made to describe a specified 
motion. x(t) is thus the input. 

Let k be the spring constant and 
c be the damping coefficient of the 
dashpot. Then if the motion is 
slow enough for the inertial forces 
to be neglected, the equilibrium of 
forces requires 


dy 2 
If we specify a characteristic time 7; as 
Ti = I (3.2) 


Then the equation of motion can be written as 


n% + y =r (3.3) 
The initial condition is simply 
y(0) = yo (3.4) 


By multiplying Eq. (8.3) by e~ and integrating from t = 0 tot = ©, we 
have the transformed equation 


(ris + 1)V(s) = X(s8) + Tyo 


Therefore 
pE X (s) T1Y0 
V(s) 7 Tis -+ l 718 + 1 (3.5) 
Hence the output due to input is given by 
1 
Y,(s) = a Fit (s) (3.6) 
and the output due to the initial condition is given by 
_ To 
Y.(s) = ES (3.7) 
The transfer function F(s) is thus 
1 
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Equation (3.6) can be represented graphically as shown in Fig. 3.2. 
This simple visual aid is very helpful in picturing the situation and is 
generally called the block diagram. 


; TG) 
x(t) 
1 
X(s)-9 F(s)= -——— p> Y; (8) 
roaren rie ne 0 
Frc. 3.2 Fie. 3.3 


Let us investigate the output for a few special cases of the input z(t). 
Consider first the case when z(t) is the unit step function 1(¢) shown in 
Fig. 3.3. Then 

x@) = [eva =} 
0 $ 


and 


] l l 
Pls) = tD 37 EFO 


Thus the output due to input is, according to our dictionary, Table 2.1, 


1) = ee) (3.9) 
The output due to the initial condition is, Eq. (8.7), 
Yht) = yoo (3.10) 


These output characteristics are shown in Fig. 3.4. Thus the output 
due to the initial condition is a pure 

1 subsidence with the characteristic time 

tı The output due to input is an ex- 

ponential approach to the asymptote, 

again with the characteristic time 7}. 

(a) Infact att = 71, the output y,(t) reaches 

63 per cent of the asymptotic final value. 

M The error signal e(t), defined as the 
difference between the input x(t) and the 

Yel(t) output y;(¢) is, for the case under study, 


elt) = x(t) — y(t) = em (8.11) 


(b) Therefore the error signal vanishes as 
Fra. 3.4 i— ©, 
Consider now another example of the input. Let the input be sinus- 
oidal, or 
a(t) = Lmt 
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where £m is the amplitude and w is the frequency. Then 


Xa = (3.12) 


— s = iw 
The output due to the initial condition is the same as before. The out- 
put due to input is given by 


I T 1 1 
Vi(s) = am (s — ta)(rig +1) 1 + wry ( $ + (1/71) T s— =) 
Therefore, according to our dictionary, the output y;(¢) is 


git 


GEES Lm ~—t/T | Lm 
y(t) = 1 + iwr a a 1 + wr 


The first term is a pure subsidence, and the second term is the steady- 
state output. Thus 
[y (É) lsteaay = l eo : 
a(t) ~ Lt tery (1) 
This is in full agreement with our general result given in Eq. (2.16). 


since 
l 1 


L+ iorn 4/1 + or? 


the steady-state output can be expressed as 


gitan ory (3 f 13) 


Tm 
y(t) \teaay = —————_——= 
Olas = ae 


e? (wt—tan“lwr1) 
Therefore the amplitude of'the steady-state output is reduced by the 
factor 1/4/1 + 7? in comparison with the input, and the phase of 
the output lags behind the input by the amount tan7! wr; For low- 
frequency inputs, 

ly (É) letenay  Tme@ ov Tw KI (3.14) 


That is, the amplitude is not modified, but there is a time lag equal to 
the characteristic time 7, of the transfer function. For high-frequency 
inputs, 


[y (E) letenas © = giad) tw >> 1 (3.15) 


Then the amplitude is reduced by the factor 1/wr:, and the phase lags 
by 7/2. These characteristics of the output are shown in Fig. 3.5. 

3.2 Representations of the Transfer Function. The transfer function 
F(s) is a function of the complex variable s. Since it is generally the 
ratio of two polynomials in s, the function F(s) is determined up to a 
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constant by the zeros and the poles of F(s). This constant can be fixed 
by knowing the value of F(s) at any particular s. The most convenient 


sis the origin. In fact, 
| [F(0)| = K (3.16) 


has a physical meaning: it is the ratio of output to input with a constant, 
non-time-varying input. K is actually called the gain of the system. 
Therefore the transfer function F(s) is uniquely determined by the gain, 
the zeros, and the poles. This is one possible representation of the 
transfer function. For example, the simple transfer function specified 
by Eq. (3.8) has a gain of unity, a simple pole at —1/r:, and no zero. 


x(t} 


[yte] 


Ly ( t ar 


Tih 


Tyo <1 Tye >] 
Low Frequency High Frequency 


(a) (b) 
Frc. 3.5 


Tf both the real and the imaginary parts of F(s) along the imaginary axis 
of the s plane are given, then by the principle of analytical continuation, 
F(s) is essentially determined for any s. Therefore another possible 
representation of F(s) is the complex function F(iw), where w is real. 
For physical systems, the coefficients in the numerator polynomial N (s) 
and the denominator polynomial D(s) of F(s) are all real. Then if we 
denote the complex conjugate of F by Ë, 


F(~iw) = FG) | (3.17) 


Therefore for physical systems, knowledge of F(tw) for w > 0 will be 
sufficient for the determination of F for any s. But we know from Eq. 
(2.16) that F(iw) is the ratio of steady output to the sinusoidal input 
of frequency w. F (iw) for all values of w is called the frequency response 
of the system. Therefore the frequency response is another representa- 
tion of the transfer function. The frequency response of our simple 
first-order system is given by Eq. (8.18). 
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One way to present the frequency response was devised by H. W. Bode 
and is called the Bode diagram. Let the magnitude of F (iw) be M and 
the argument be 0, 2.e., 

F(tw) = Me’ (3.18) 


The Bode diagram consists of the plots of log M and 0 against log w. 
The choice of a logarithmic scale for M but not for 0 will be made mean- 
ingful by later discussions. For our simple system of Eq. (3.13), 


i l 
Se ee 
Vitor Sltuw u= (3.19) 
and 0 = — tan! or; = — tan! u 


where u is the dimensionless frequency. The Bode diagram of this 
system is shown in Fig. 3.6. The behavior of the frequency response 


0 
0 
-T 
2 
log you 
] 
b= 
ane 1+iwr, 
Fie. 3.6 


at low and at high frequencies is that already indicated by Eqs. (8.14) 
and (3.15). As u— œ, the graph of logy M against logio u has the 
slope —1. For small values of u, the slope is nearly 0. Therefore the 
M ~ u diagram for a first-order system can be approximated by two 
straight lines. 

In acoustical and electrical literature, itis customary to plot 20 logis M 
instead of logio M in order to convert the amplitude units into decibels. 
A doubling in frequency is called an octave, and thus the region of the 
plot in Fig. 3.6 where the logio M curve has a slope of —1 is described 
as a region of slope —20 logio 2 = —6.02 db per octave. We note also 
that on the same plot, the approximate logiy M line goes through 0 at 
u = 1, that is, at w = 1/rı. Therefore we can measure the frequency 
response of a first-order system and plot the measurement as indicated. 
The characteristic time rı of the system can easily be estimated by 
noting the frequency at which a straight-line approximation for large 
-values of w crosses the horizontal axis. 

Another way. to present the frequency response was devised by H. 
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Nyquist and is called the Nyquist diagram. Here the complex quantity 
F(tw) or 1/F (iw) is directly plotted in the complex F or 1/F plane. The 
parameter of the curve is the frequency w. For a simple first-order 
system, F(iw) is a semicircle, starting at 1 for w = 0, going through 
1/1 +4) = (1/+/2)(1 — 2) at wr: = u = 1, and ending at the origin 
for w— œ. The 1/F diagram is much simpler: 1/F = 1 + wr, and 


1+ iz 


l dis 
1+ iu F 
Fra. 3.7 
thus the diagram is simply a straight line parallel to the imaginary axis. 
These Nyquist diagrams for the first-order system are shown in Fig. 3.7. 
3.3 Examples of First-order Systems. There are many elements of a 
complex system that can be approximated by a first-order transfer func- 
tion. We shall briefly discuss a number of examples of such elements 
in this section, together with the proper diagrams for their frequency 
responses. 
Integrator. An electric motor whose speed d¢/dé is proportional to 
an input voltage v follows the equation 


TsK (3.20) 


where K is a scale factor. Thus the angular position ¢ of the rotor of 
the motor is proportional to 


[va 


This relation is represented by the 
block diagram in Fig. 3.8 with the 
Laplace transforms V(s) and &(s) for v and ¢. This transfer function 
F(s) = K/s is the limiting case of the function 1/(rs + 1) when 7, > © ; 
and it is represented by a simple pole at the origin. In order to consider 
the constant K as the gain of the transfer function F(s), we have to 
modify the definition first introduced in the previous section. The defi- 
nition there given is suitable for transfer functions having no zero or 


¢(s) 
$ (t) 


Fra. 3.8 
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pole at the origin. The gain K of an integrating system, i.¢., a system 
whose transfer function F(s) has a simple pole at the origin s = 0, should 
be defined as 

K = lim |sF(s)| (3.21) 


30 


The frequency response is 


Piia) = É = (E) cw 


a) 


x 
= 2 = -3 (3.22) 


The Bode diagram is thus that shown in Fig. 3.9, and the Nyquist dia- 
gram is that shown in Fig. 3.10. 


logioM 


lOZ 104 log 19 


Fliu)=# 


Fic. 3.10 


Differentiator. A rate gyro gives a voltage output v proportional to 
the angular velocity d@/dt of the precession axis, i.e., 


_ Kg? 

p= Be 
where K is the factor of proportionality. This case is the inverse of 
the preceding one. The transfer function F(s) = Ks has a zero at the 
origin. Thus the gain of a differentiating system, i.e., a system whose 
transfer function F(s) as a simple zero at the origin s = 0, should be 
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defined as 
F(s) 


: 8 
K = lm py 


s—0 


(3.23) 


The block diagram is shown in Fig. 3.11, the Bode diagram in Fig. 3.12, 
and the Nyquist diagram in Fig. 3.13. 


(3) | V(s) Simple Lag Network. Consider 
$(t) da v(t) the resistance’ and capacitance net- 
Fie. 3.11 work of Fig. 3.14. If71s the current 


flowing in the resistance & and the 
capacitance C and if there is no charge in the capacitance at t = 0, then 


t 
ik +3 | Oat =o 
0 


1 t 
T i a(t) dt = V9 
By multiplying these equations by e~* and integrating from t = 0 to 


T 
log M 2 
9 
lOg ig 10819 W 
F(iw)=Kiw=Me® 
Fig. 3.12 
w=0 W = OO 

F(iw)=Kiw 1 _ 1 

Fliw i 
Fre. 3.13 


t = ©, we have 
l 
(z + as) J (s) = Vi(s) 


a T (8) = Vols) 


Therefore 


(3.24) 
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Hence the transfer function of this RC circuit is the same as the canti- 
lever spring with dashpot, and the characteristic time ist, = RC. The 
Bode diagram and the Nyquist diagram are thus given by Figs. 3.6 and 
3.7. This circuit is frequently used in order to introduce a phase lag 
into a system. 

R Ry >} 


Fic. 3.14 Fie. 3.15 


Lead Network. A more complex circuit is that shown in Fig. 3.15. 
The controlling equations are 


j = Jı + Je 
1 fi 
Raji = a I Jat) dt 


and 
v = Raji F fej 
Va = Re 
The corresponding transformed equations are 
J =Ji +J: 
Rhi =gh 
and 
Vi = Rda + Rol 
V, = Rot 
Therefore 
V2(s) = F(s) = R: + R RCs 
V(s) (Ry -+ Rə) +- R RCs 
Hence the gain is 
Ro 
K = R, + R, =r (3.25) 


and it is necessarily less than unity and is generally between 0.1 and 1. 
If we introduce the symbol w; as 


_ ky + ke 
Qi = RBC (3.26) 
then the transfer function can be written as | 
F(s) = „ lt (s/rer) (3.27) 


1 + (s/w) 


Therefore the transfer function has a zero at —rw; and a pole at — w. 
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The frequency response is then 


One mte (3.28) 


If we introduce the nondimensional frequency u as 


l aw 
_ tle 3.29 
u a (3.29) 
then 
M =r mA 0 = tan”! F — tan (yru) (3.30) 
Hence 


> ee 
logio M(u) = logu yr + logio eee 


_ Lae (eu?) 
= logio a/r — logio OA + jw) 


6(u) = 19 


The Bode diagram has thus a certain symmetry with respect to u = 1, 


and 


0 OZ ou 0 logiou 
ue OE oe 
F(i )= wW ti fr Wy 
Fre. 3.16 


as shown in Fig. 3.16. The maximum value of 6 occurs at u = 1 and 
is equal to 


l 
imax = tana! Va — tan! = = — 2 tan 4/r (3.31) 


Therefore this circuit gives a considerable phase lead over a band of 
frequencies. For very large w, M = 1. For very small w, M = r. 

Restricted Lag Network. The RC circuit shown in Fig. 3.17 has the 
following transfer function: 


V2(s) = (8) = ji + RCs 
Vils) — 1+ (Ri + BCs 
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The gain of the system is thus unity. If we introduce the parameters w 
and r defined as 


L 1 _ R; Ca a 
o= Ba B meg IP (3.32) R c | 
then the transfer function is | FS | 


F(s) = m (3.33) Fre. 3.17 


By comparing this equation with Eq. (3.28) for the lead network, we 
see that the two circuits have transfer functions that are reciprocal to 
each other. In fact the frequency response in the present case can be 
written as 

1 +ilo/o) ltivru 

1 +i (w/rw) 1+ill/vyvr)u 


where u, the dimensionless frequency, is 


F(iw) = 


= — — (3.34) 
Thus 
(1/r) + u? ee oe E 
M =r eee 0 = tan™! (wr u) — tan a (3.35) 


The corresponding Bode diagram is thus that shown in Fig. 3.18. There 
is a phase lag for a range of the frequency. The maximum phase lag 


logo Vr 
PE DA 
log iow 0 logiou 


Itig lw 

F(iw)= L u=- 

! itin VP 
Fre. 3.18 


occurs at u = 1, or w = 4? w, and its magnitude is given by Eq. (3.81). 

Simplified Rolling Motion of an Airplane. Let I be the moment of 
inertia of the airplane about its longitudinal axis, ọ the roll angle, Lp 
the aerodynamic damping due to roll, and kô the torque applied by 
the aileron deflection ô. The equation for the roll angle ¢ is thus 


Ë% dp -kô 


Lag t ley 
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Now let p = do/dt be the roll speed; then the above equation becomes 


If the roll speed is zero at £ = 0, then the transformed equation is 
(Is + Ly)P(s) = k A(s) 
The transfer function F(s) is thus 


P(s) _ k k l 

o eee e Gi 
The behavior of the system, as determined by the transfer function, is 
thus similar to the cantilever spring with dashpot and the simple lag 
network. Here the characteristic time 7, is //L,. If the damping is 
very small, then rı > œ% and the behavior of the system becomes that 
of the simple integrator. 

3.4 Second-order Systems. Let us return to the cantilever spring 
with a dashpot (Fig. 3.1). But now we attach a mass m to the dashpot 
end. The mass will introduce an inertial force m d*y/dt?, and the equa- 
tion of motion 1s now 


m 72 T s% 4 ky = kz 
with the initial conditions 
y(0) = yo 
(2) = D | (3.37) 
dt Ji=0 Yo 


The differential equation of motion can be rewritten in a more convenient 
form by introducing the following parameters: 


soi (3.38) 


wy is thus the natural frequency of the mass-spring combination when 
the dashpot is absent. ¢ is the ratio of actual damping to critical damp- 
ing. The meaning of this nondimensional parameter will be made clear 
presently. Then the differential o becomes 

A Ar o + WY = wee (3.39) 


Equation (8.39) together with the initial conditions of Eq. (8.387) can be 
converted into the following relation in terms of Laplace transforms: 


(s? + 2Wwos + o) Y (s) = wX (s) + yP + (8 + Feo) yo 
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The output due to initial conditions is then given by 


_ Yas + (YP + 2twoyo) 


Y,(s) = #  Qfaes + ot (3.40) 
The transfer function is then 
F(s) = SN ea a tae (3.41) 


X(s) — (8/wo)? + 26(8/wo) + 1 


The transfer function thus has a gain K = 1 and no zeros. It has 
two simple poles at 


= — pt Vo] | 
: e>1 (3.42) 
a —p- Vea 


When the damping coefficient of the dashpot is smaller than the critical 
damping, the value of ¢ will be less than unity. In that case, the poles 
sı and s: will be complex conjugates, having real and imaginary parts 
à and v, respectively: 


Si/wo = —ti +i V1 —- = (A + tv) /wo = e 
s/o = =t- ivl- P =A tv) /wo = enti 


where the last expression is possible because the absolute value of either 
Si/wo OF 82/w is one. For positive damping, à is a negative number. 

The output y(t) due to initial conditions can be easily determined 
from Eq. (3.40). Thus, for ¢? < 1, the poles of the transfer function are 
given by Eq. (3.43), and we have 


<1. (8.43) 


(1) = 
y(t) = a e^ sin vt + Yo e cos vt + = Yoe sin vt (3.44) 


Since à is a negative number, the output is damped, but nevertheless 
it is a damped sinusoidal function. If, on the other hand, ¢? > 1, then 
the output is a pure subsidence. Thus for dampings greater than the 
critical value, there is no oscillation in the output y,(t). This is the 
meaning of critical damping. 

Now let us assume that the input z(t) is the unit step function 1(é) 
shown in Fig. 3.3. Then X(s) = 1/s, and for ¢ < 1, 


2 
s[(s — A)? + 7?] 
The output y;(¢) due to input is then 


y = 1 — co vt + ( >) sin n eM (3.45) 


Y,(s) = 
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When ¢ > 1, the output is not oscillatory and is calculated as 


1 esl es 
á eal ee ace et many 


where sı and s: are given by Eq. (3.42). Such behaviors of the output 
y,(t) are shown in Fig. 3.19 for various values of the damping ratio ¢. 
It is seen that for a quick approach to the asymptotic value, ¢ should 
not be too large. On the other hand, if ¢ is too small, there will be rather 


1.6 
1.2 
Sos ea le Ser > = 
a 

~ 4 
= 08 
0.4 
0 

0 2 4 6 8 10 
Woe 
Frc. 3.19 


persistent oscillations with high peaks. Here an engineering compromise 
has to be made, and the usual practice is to select a damping ratio ¢ 
between 0.4 and 1. 
If-the input is a sinusoidal oscillation with amplitude z,, and frequency 
w as specified by Eq. (8.11), then 
2 


$ £ w 
Y,(s) = ——- F(s a e pee co 
i(s) s — Ww (s) S — tw 8? + Zrans + ow} 


Therefore the output y,(t) due to input is, for ¢? < 1, 


pee -oit em OD tint 
YD) = Saf ae + 2iv A + iv — w) : 
om _ Ob it 
2iv A — ilv — w) k (9H) 


where \ and v are given by Eq. (8.43). Since d is negative for positive 
damping, the steady-state output is again simply given by the first 
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f=0.4 | 


+1 
l0g10 (ar) 
® 
Ber 0 F +1 
logs (aig) 
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term of Eq. (3.47). This is in agreement with our general result of 
Eq. (2.16). 

The frequency response of our second-order system is, according to 
Eq. (8.41), 


Papa Ne = ir a T 
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Therefore 
M = ———eeeee 3.48 
Vl i (w/ao)*}? F [25 (w/wo)]? 
and 
26 (w/wo) 


tan 0 = — (Cie CER. 


The corresponding Bode diagram is shown in Fig. 3.20. The maxima 
of M occur near w/w) = 1, where M ~ ṣ¢ and @ ~ —r/2. As w/w %, 
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6— —r and M ~1/(w/w)*, or log M ~ —2 log (w/w). The acous- 
tical engineer will then say that the slope for high frequency is —12.04 db 
per octave. 

The Nyquist diagram for our second-order system is shown in Fig. 
3.21. 

Other physical systems can usually be approximated by a second-order 
transfer function. The hydraulic servo system is one example. A better 
approximation to the behavior of the rate gyro discussed in Sec. 3.3 
is the transfer function 

Ks 
rae 
9 = Jo F AFi 


This more accurate transfer function should be compared with that given 
in Fig. 3.11. The transfer function for an accelerometer is 
_ Ks? 

(s/w)? + 26(8/wo) + 1 
The electric motor used as an integrator has a more accurate transfer 
function 


F(s) 


K 


iC Ra Ti 


This transfer function should be compared with the previous crude 
approximation given in Fig. 3.8. All these transfer functions have a 
second-order polynomial in the denominator. The constants in them 
have meanings similar to those in the examples discussed previously. 

3.5 Determination of Frequency Response. In the discussions of the 
previous sections, we have considered the problem of knowing the struc- 
tural details of a system and of calculating the transfer function F(s) 
and the frequency response F(iw) by elementary physical laws. This 
procedure of determining the frequency response is thus theoretical, and 
its accuracy depends upon the accuracy of our knowledge of the system. 
Very often in engineering practice, our knowledge of the detailed struc- 
ture of the system is incomplete, or if sufficiently complete, the system 
is so complicated as to make the theoretical calculation of the frequency 
response too lengthy to be practical. In such cases, it is often necessary 
to determine the frequency response experimentally. The simplest 
method conceptually is to utilize the fact that the ratio of steady output 
with a sinusoidal input of frequency w to the input is equal to the fre- 
quency response F (tw) as shown by Eq. (2.16). The ratio of the amplitudes 
of output and input is M, and the phase difference between output and 
input is 0. Therefore this experimental method involves the determina- 
tion of amplitude ratios and phase differences for a number of frequencies 
w in the desired range. It has been applied to problems varying from 
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such a relatively simple system as a fuel pump! to as complicated a 
system as the longitudinal motion of a complete airplane.” The draw- 
back of this method is the lengthy experimentation generally required 
for a wide range of frequencies. Sometimes it is also difficult to deter- 
mine the phase difference between the output and the input. 

A more efficient method is to excite all frequencies simultaneously 
instead of individually. The best method for doing this is to use a 


x(t) x(t) 


T t 0 ek t 
Rectangular Pulse Triangular Pulse 


Fic. 3.22 


unit impulse as the input. Then, according to Eq. (2.18), for stable 
systems (y = 0), 


TE + a Fli des 


: i [RF (tw) cos wt — JF (iw) sin wt] dw (3.49) 


T JO 


0 


where & and g denote the real and the imaginary parts, respectively. This 
last step.is possible because of Eq. (3.17). Equation (8.49) shows that 
an input impulse excites all the frequencies of the system uniformly. 
When the response h(t) of the system to a unit impulse is determined, the 
frequency response can be computed as 


Fliw) = i. POT. (3.50) 


This integration can be carried out numerically for a number of frequen- 
cies. 

Practically, however, it is dificult to make the input be an impulse. 
The more practical inputs are a single rectangular pulse and a single 
triangular pulse, as shown in Fig. 3.22. Such inputs will not excite 
all frequencies uniformly. But if we make the length r of the pulse 
small, the ideal uniform excitation can be approached. This method of 
pulse excitation has been applied to determine the frequency response of 
an airplane by R. C. Seamans and his coworkers.? They have also 


1H, Shames, S. C. Himmel, D. Blivas, NACA TN 2109 (1950). 

2 W. F. Milliken, J. Aeronaut. Sci., 14, 493 (1947). 

3 R. C. Seamans, B. P. Blasingame, G. C. Clementson, J. Aeronaut. Sci., 17, 22 
(1950). 
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developed an approximate method for computing the F(iw) from the 
measured output y(t). The method of data reduction has been general- 
ized to arbitrary inputs by H. J. Curfman and R. A. Gardiner.’ 

3.6 Composition of a System from Elements. The systems studied 
in Secs. 3.1, 3.3, and 3.4 are really only elements in the much more 
complicated system generally found to be necessary in stability and 
control engineering. Take the example of the rolling motion of an air- 
plane. The signal to move the aileron is usually in the form of an electric 
current. This signal is the input to an “amplifier” and computer group, 
which is a rather involved electric circuit and may contain vacuum tubes. 
The behavior of the amplifier and computer is determined by its transfer 
function F(s). The output of the amplifier and computer is then taken 
to be the input to the hydraulic servo which moves the aileron. The 
behavior of the hydraulic servo is specified by the servo transfer function 


X(s) F(s) > Ris) þ----> F(s) Pe---- ¥(s) 


Fig. 3.23 


F(s). Finally, the output of servo, the aileron motion, is taken as the 
input to the system representing the airplane dynamics. The airplane 
dynamics gives the transfer function F(s). The output of the airplane 
dynamics is the rolling motion. Here we have a series connection 
between the various elements of the system from rolling signal to rolling 
motion. If the rolling signal is denoted by x(t) and the roll angle by 
¢(t), then the Laplace transforms are related by 


g(s) = Fs(s)Fo(s)F1(s)X(s) 


Therefore the over-all transfer function of the roll-control system is the 
product F1(s)F'2(s)F3(s). This example also illustrates clearly the fact 
that a transfer function is generally dimensional: it is the ratio of two 
quantities of different dimensions. The input, or roll signal, is an electric 
current; and the output, or roll angle, has the dimension of an angle. 

In general, then, if a system is composed of individual elements of 


transfer functions F1(s), Fs(s),..., Fe(s), .. . , Fx(s) with gains Ky, 
Ky, ..., Ky, ..., Ka and if the elements are in series (Fig. 3.23), 
then the over-all transfer function F(s) is the product 
F(s) = Fi(s)Fe(s) + + + F,(s) + + + Fas) (3.51) 
The gain K of the system is then 
K = KiKk.:-: K++ Ky (3.52) 


From Eq. (3.51), it is evident that the system transfer function F (s) has 
the totality of the zeros and poles of the individual elements. This fact 


1H. J. Curfman and R. A. Gardiner, NACA TR 984 (1950). 
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together with the gain K, computed by Eq. (8.52), completely determines 
the transfer function F(s). 

The frequency response of the system is F(iw) = Me’. If the fre- 
quency response of rth element is M,e’? then, according to Eq. (8.51), 


Mei! = (Met) (Mt) LAUA (M e’) Stet n (M net) 
Therefore 


logio M = log 10 Mı + logio M+: + logio M, +> 
Sa logio Mn (3.53) 


and 0= b +0 tH °°: eHe 


Equation (3.53) gives the reason for the choice of a logarithmic scale 
in the Bode diagram. This choice makes the work of finding the system 
characteristics easier by requiring only simple addition of the ordinates 
of individual diagrams. 

3.7 Transcendental Transfer Functions. The Laplace-transform 
method is applicable not only to initial-value problems of linear ordinary 
differential equations with constant coefficients, but also to linear partial 
differential equations! with coefficients that are independent of the time 
variable ¢, and with boundary conditions partially described as initial- 
value conditions int. By applying the Laplace transform to the original 
partial differential equation, the time variable tis removed, and in its place 
a parameter s appears. The resultant equation is a linear differential 
equation with respect to the remaining independent variables and can be 
solved as such by using the remaining boundary conditions. The proce- 
dure involved here is evidently much more complicated than in the case of 
ordinary differential equations discussed in Chap. 2. On the other hand, 
if any two specific quantities in the solution of the transformed equation 
are compared, they still bear a linear relation. If one of them is con- 
sidered to be an input and the other an output, the ratio of output to 
input is still a function of the parameter s and can still be considered as 
the transfer function F(s). There is, however, one difference: the trans- 
fer function is no longer the quotient of two polynomials of s. It is, in 
general, a transcendental function in s. 

For instance, for two-dimensional flows, W. R. Sears? has calculated 
the effects of a small vertical “gust” velocity v in the fluid on an airfoil 
of chord c in a stream moving horizontally with a uniform velocity U. 
Let v vary sinusoidally with respect to x, the horizontal coordinate, and 


1 See for instance H. S. Carslaw and J. C. Jaeger, “Operational Methods in Applied 
Mathematics,” Oxford University Press, New York, 1941; or R. V. Churchill, ‘ Modern 
Operational Methods in Engineering,” McGraw-Hill Book Company, Inc., New York, 
1944. 

2 W, R. Sears, J. Aeronaut. Sci., 8, 104 (1941). 
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t (Fig. 3.24), so that 
v(x t) = amy ee e0 (3.54) 


where a, is the amplitude and w the “frequency.” For this gust velocity, 
Sears has shown that the lift coefficient C1, that is, the average lift force per 
unit area of the airfoil divided by the “dynamic pressure” dpU? (p is 


U 


(x. Ut) Pa aa 


Fie. 3.24 
the density of the fluid), is given by 
Cy = Qrame™te(k) (3.55) 
where 
we 
c= JÜ (3.56) 
and 


g) = TK Gik) + (k Kolik) 
we Ky (tk) + Kolik) 


The J’s and K’s in Eq. (3.57) are the Bessel functions of the first kind 
and the modified Bessel functions of the second kind, respectively. 
Therefore, if we take X(s) as the Laplace transform of v(0,t), the input, 
and Y(s) as the Laplace transform of C,(t), the output, then the transfer 
function F(s) is 


(3.57) 


Y(s) _ 
Xe) F(s) 
and the frequency response F (iw) is 
Flia) = a plk) (3.58) 


The frequency response is thus a transcendental function. 

The application of such concepts as the transcendental transfer func- 
tion and frequency response to the problem of flutter of airplane wings 
has been demonstrated by J. Dugundji.! 


1 J. Dugundji, J. Aeronaut. Sci., 19, 422 (1952). 
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FEEDBACK SERVOMECHANISM 


In this chapter we shall introduce the central concept of modern stabil- 
ity and control engineering: the concept of feedback. We shall intro- 
duce this concept by discussing the simplest systems—linear systems 
with constant coefficients. We shall show how the feedback can greatly 
increase the degree of accuracy in control and the rapidity of response to a 
signal. We shall then explain the principles of designing such feedback 
servomechanisms for stability and for optimum performance. 

4.1 Concept of Feedback. Let us consider the problem of controlling 
the rotational speed of a turboalternator. The primary purpose here 
is to keep the speed very close to the normal value. The most ele- 
mentary approach to this problem would be the so-called open-cycle 
control, where we try to balance the torque generated by the steam 
turbine and the torque absorbed by the alternator, the load torque. 
This could be done by measuring the load and by opening the steam 
throttle accordingly. However, it is to be expected that such balancing 
cannot be perfect; there is always an error torque, z(t). This error 
torque tends to accelerate the machine. If we denote by y(t) the speed 
deviation from the normal, by J the moment of inertia of the rotating 
parts of the machine, and by c the 
damping due to the windage loss, 


l : : : ‘ 
F,(s)= then the differential equation is 


X(s) Is+e Y(s) 


Fic. 4.1 i + cy = a(t) (4.1) 

The block diagram of this system is shown in Fig. 4.1. It is seen that 
the system is the familiar first-order system studied in the last chapter, 
that the characteristic time of the system is J/c, and that the ratio of the 
steady-state value of speed deviation to the error torque is 1/c. Now 
I is a very large quantity because of the heavy weight of the rotor of a 
turboalternator, but c is a very small quantity because of the small 
windage loss. Hence the characteristic time is extremely long. This 
means that any speed deviation will persist and be difficult to remove. 
Furthermore, for small speed deviation, the error torque has to be 


extremely small, because of the large magnification factor l/c. Need- 
34 
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less to say, this system of keeping the turboalternator at constant speed 
is quite useless in practice. 

Now consider the change in performance caused by changing the system 
to the so-called closed-cycle control. In the closed-cycle control, we 
make the control torque depend upon the controlled variable. That is, 
we cause the steam-throttle opening to depend not only on the load but 
also on the speed deviation y. Let the second component have a factor 
of proportionality —k. When the speed is too high, or y > 0, then the 
throttle is closed, and the accelerating torque is reduced by the amount 
ky. When the speed is too low, the accelerating torque is increased 
by the amount ky. Thus the differential equation for y is now 


r+ (e + Ry = zC) (4.2) 


The only difference between Eqs. (4.1) and (4.2) is the replacement of c 
by the sum c + k. Hence the characteristic time is now J/(¢ + k), and 
the ratio of steady-state speed deviation to the error torque is 1/(c + k). 


Feedback 
Fie. 4.2 


Therefore, in comparison with the open-cycle control, we can greatly 
reduce both the characteristic time and the speed error by making k 
very much larger than c. But this can be accomplished quite easily, 
because cis so small. Therefore the closed-cycle control can be designed 
for quick response and for accurate control and thus achieve a great 
improvement in performance. 

The block diagram of the closed-cycle control can be drawn as Fig. 4.2, 
retaining the intrinsic transfer function of the turboalternator shown in 
Fig. 4.1. In Fig. 4.2, we have introduced a convention in servomecha- 
nism engineering: addition or subtraction has to be specifically indicated 
at the symbol for the mizer. If at the junction of two links only a dot is 
used, no addition or subtraction occurs. The quantity is only “meas- 
ured.” Thus the speed deviation y is measured at the output side and is 
used to generate the control torque. It is seen from Fig. 4.2 that the 
closed-cycle control involves a feedback link. The whole system is thus 
appropriately called a feedback servomechanism. 

Although for the simple example analyzed above the advantage of a 
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feedback servomechanism can be shown by comparing the differential 
equations (4.1) and (4.2), for more complicated systems the analysis can 
be conveniently carried out only by the concept of transfer functions. 
This method is expounded in the following sections. 
4.2 Design Criteria of Feedback Servomechanisms. Let us consider 
a general feedback servomechanism with arbitrary transfer functions F(s) 
and F(s), similar to that represented by Fig. 4.2. Fı(s) is called the 
transfer function of the forward circuit and F.2(s) the transfer function 
of the feedback circuit. Then the input X(s) and the output Y(s) are 
related as follows: 
Y(s) = Fi(s)[X(s) — F2(s) ¥(s)] 


By solving for Y(s), we have 


Y(s) _ F(s) 
X(s) 1 +F(9)F:(8) 


where F,(s) is thus the system transfer function, or the output-input 
ratio of the complete system. 

It will be convenient for later discussions to indicate explicitly the 
gains K, and K, of the transfer functions F(s) and F(s). Thus we write 


F(s) = K:G(8) 
F(s) = K2G2(s) 


It is evident that G(s) is nondimensional; the dimension of the transfer 
function is all absorbed into the gain K. All information about the 
“structure” of the transfer function is contained in G(s), that is, G(s) 
gives the zeros and the poles of the transfer function. In the subsequent 
discussions, we shall usually think of the effect of the transfer function 
on the performance of the system as the result of two separate influences: 
the influence of the locations of the zeros and the poles of the transfer 
function, t.e., the influence of G(s); and the influence of the magnitude 
of the gain K. This separation of effects is further justified by the fact 
that the structure of the transfer function G(s) is controlled by the com- 
puter element of the amplifier-computer group in the composite system 
of F(s). The gain is controlled by the amplifier element of the amplifier- 
computer group. Moreover, these two controls in the design can be 
affected almost independently of each other. Therefore G(s) and K can 
indeed be modified separately and can be considered as separate. 

By using Eq. (4.4), Eq. (4.3) can be written as 


= F,(s) (4.3) 


(4.4) 


Y (s) e _ K,Gi(s) _ 1 
XQ)" S IFERORGO EAO FEAA “P 


The Laplace transform of the error e(t) defined by Eq. (3.11), if denoted 
by £(s), is then 
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eee eae) a E ee ne ae 

Y(s) = Y(s) a F,(s) l = K-G (s) [1 KoG'(s)] (4.6) 
For simple feedback servomechanisms, as shown in Fig. 4.8, the transfer 
function of the feedback link F(s) is simply unity; i.e., the output is only 


measured, not modified for the feedback control. Then Eqs. (4.6) and 
(4.7) simplify to 


Y(s) _ — Kas) | 1 
¥@) 7 “Ti Ke@ ~ TEGO +1 (4.7) 
and He 
8 
Fe) ~ KGS) or 


The first requirement of a servomechanism is stability. This means 
that the output y(t) should be damped, except possibly for steady sinus- 
oidal motion. Our analysis in Sec. 2.4 shows, however, that the condition 
of stability is mathematically equivalent to the statement that F,(s) 
should have no poles in the right-half s plane, where the real part of s 
is positive. For the general feedback servomechanism, as shown by 
Eq. (4.6), the poles of F,(s) are zeros of 

a ee + KiG(s) (4.9) 
Fs) KiGi(s) 1 7" 
For the simple feedback servomechanism, as shown by Eq. (4.8), the 
poles of F;(s) are zeros of 


Ay (4.10) 


Therefore the first design criterion of feedback servomechanisms is 


(a) The function 1/F,.(s), given by Egs. (4.9) and (4.10), should not have 
zeros in the right-half s plane. 


The second requirement of a servomechanism is quick response. If 
s, is a pole of F.(s), then the analysis in Sec. 2.4 shows that the output 
has the component e**. The quickness of response is thus determined 
by the magnitude of s. The larger the magnitude of s,, the shorter the 
time scale and thus the quicker the response. Therefore the second 
design criterion of a feedback servomechanism is 
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(b) The zeros of 1/F,(s), given by Egs. (4.9) and (4.10), should all be of 
large magnitude and lie sufficiently to the left of the imaginary axis 
of the s plane. 


If the feedback servomechanism is designed for controlling the output 
to follow the input signal, the steady-state output after the removal 
of transients should be made as close as possible to the input. Therefore, 
for such “positional” controls, there is the third requirement that the 
ratio E(0)/¥(0) between the steady-state error and the steady-state 
output should be as small as possible. This condition can be translated 
into a condition on the gains of the transfer functions by using Eqs. (4.6) 
and (4.8). Thus 


(c) For positional control, accuracy of control requires for general servo- 
mechanisms, Eq. (4.7), 


Ses AsO (4.11) 


and for simple servomechanisms, Eq. (4.9), 
K>1 (4.12) 


The conditions (a), (b), and (c) are the design criteria of feedback 
servomechanisms. In practice, it is usually difficult to satisfy conditions 
(b) and (c) as fully as desired, and a compromise has generally been made. 
We shall see this in the following sections. 

4.3 Method of Nyquist. Since, as stated before, the transfer func- 
tions are usually ratios of two polynomials in s, criterion (a) of the last 
section is generally equivalent to specifying the nonexistence of roots 
with positive real parts for a polynomial. This is a classic question and 
is answered by E. J. Routh using the so-called Routh inequalities, involv- 
ing the coefficients of the polynomial under investigation. This method, 
however, isnot favored by control engineers, because of the obscure manner 
of the variation of the Routh inequalities with changes in the coefficients. 
Engineers prefer a method of analysis which uses the transfer functions 
written in Eqs. (4.9) and (4.10) directly without further modification; 
because these transfer functions are the immediate information possessed 
and are understood “physically” by the engineer. | 

Such a method was devised by H. Nyquist. The Nyquist method is 
based upon a theorem due to Cauchy for an analytical function f(s), where — 
sis a complex variable:! | 

If f(s) has n zeros and m poles within a closed path C, then as s travels 


See for instance Whittaker and Watson, “Modern Analysis,” Sec. 6.31, p. 119, 
Cambridge-Macmillan, 1943. 
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along C once in a clockwise direction, the vector f(s) carries out n — m clock- 
wise revolutions about the origin. 

To apply this very powerful theorem to our problem, we have chosen 
the path C to enclose the whole right-half s plane, where zeros with 
positive real parts would lie. Such a path is shown in Fig. 4.4 and con- 
sists of the imaginary axis and a semicircle to the right with the radius 
R— œ. Take first the simpler case, the case of a simple feedback 
servomechanism. We note from Eq. (4.10) that the poles of 1/F;(s) 
are zeros of G(s). Let the number of zeros of 
G(s) in the right-half s plane be m. Then 
1/F,(s) has m poles in C. Therefore, in order 
for 1/F,(s) to have no zeros in the right-half s 
plane, 1/F,(s) has to carry out m counterclock- 
wise revolutions around the origin when s de- 
scribes the contour C of Fig. 4.4 with k — œ. 
But from Eq. (4.10), it is easily seen that this is 
equivalent to requiring 1/KG(s) to carry out m 
counterclockwise revolutions around the point 
—1. Butsince K isa constant, the above criterion is the same as requiring 
1/G(s) to carry out m counterclockwise revolutions around the point — K. 
Needless to say, when G(s) has no zero in the right-half s plane, or m = 0, 
then the Nyquist stability criterion requires the vector 1/G(s) to make no 
revolution around the point —K. 

Let us illustrate the application of the method by taking the simple 
transfer function 


s - plane 


Fie. 4.4 


GO 
s(1 + ris) (1 + ras) (4.13) 
Then 1/G(s) = s(1 + ris) (1 + res) 


First consider the part of the path C of Fig. 4.4 along the imaginary axis, 
where 
l 
G (iw) 


At w = 0, 1/G(liw) = i0. As w —> +œ, 1/Gliw)— —iœæ. Therefore as 
w increases from 0 to », the vector 1/G(tw) increases in magnitude, and 
its phase angle increases from 7/2 to 87/2. For negative w, the curve 
traced by the end point of the vector 1/G(iw) is simply the reflection 
of the curve for positive w about the real axis, as required by Eq. (3.17). 
Thus as s traces the imaginary axis, 1/G(iw) traces the curve aboc shown 
in Fig. 4.5. 

As s traces the large semicircle shown in Fig. 4.4, 1/G(s) ~ s*. Then as 
s rotates from tœ to —1 clockwise, 1/G(s) will also rotate clockwise, 
but three times as fast. This part of 1/G(s) is thus represented by the 


= tw(1 + try) (1 + irw) 
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curve ¢ to a in Fig. 4.5. From this figure, it is seen that if K = Ky as 
indicated in the figure, then the vector 1/G(s) will make no net revolutions 
around the point —K;. Since the function G(s), given by Eq. (4.18), 
has no zero, this means that the feedback system will be stable. If 
K = Ky as indicated, then the vector 1/G(s) will make two net clock- 
wise revolutions around the point ~Ky. Therefore, with this larger 
value of K, the feedback servomechanism will be unstable. In fact, there 
will be two poles of F,(s) with positive real parts. The transition point 
from stability to instability is the point b. Stable values of the gain K 
must lie between the origin and this point. 
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For a general feedback servomechanism, the question is whether there 
is any zero of the expression 1/F,(s) given by Eq. (4.9) in the right-half 
s plane. To use this expression directly for the Cauchy theorem is incon- 
venient, because then we have to add two vectors 1/KyG;(s) and K.G42(s). 
Now let Gi(s) and Go(s) have mı and me zeros, respectively, in the right- 
half s plane. The respective numbers of poles in the right-half s plane 
are nı and ns. Then it is evident that the number of poles of 1/F,(s) 
in the right-half s plane is m, + ns Now let us divide 1/F,(s) by 
K.G2(s). This operation will introduce m: poles and n: zeros into the 
expression. But there is a possibility that some of the zeros may be the 
same as the poles, and thus both are removed. Let the number of zeros 
and poles thus removed bea. Now 


es 1 , 
Ee LA N 


The number of poles of 1/F,(s)K2G2(s) in the right-half s plane is the 
number of poles of 1/K,K2G1(s)@2(s) and is thus equal to mı + mo. Now 


(4.14) 
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let us assume that there is no zero of 1/F,(s) in the right-half s plane, 
1.e., the feedback system is stable. Then the numbers of zeros and poles 
of various expressions are as listed in Table 4.1. It can then be easily 
deduced that n: — a = 0, and 1/F;(s)K2G2(s) also has no zeros in the 


TABLE 4.1 
Number in right-half s plane of 
Expression 

Zeros Poles 
G(s) my Ny 
G(s) Me Ne 
1/F,(s) 0 Mı + Ne 
1/F5(s)K2G2(s) Ne —a@ | M+ Ne + M2 —-— a = m + Me 


right-half s plane. Hence as s traces the path C specified in Fig. 4.4, 
the vector 1/F,(s)KeG2(s) should make — (mı + m2) clockwise revolutions 
around the origin. By referring to Eq. (4.14), it is seen that this con- 
dition of stability is equivalent to requiring the vector 1/K1K2Gi(s)G2(s) 
to make mı + m: counterclockwise revolutions around the point —1. Or 
we may require the vector 1/Gi(s)@2(s) to make mı + m counterclock- 
wise revolutions around the point — (KK). This is the Nyquist cri- 
terion for stability of a general feedback servomechanism. 

The essential part of the locus of the path of integration in the Nyquist 
method is the part where s = ww, as is clearly demonstrated by our 
example in Fig. 4.5. Therefore the stability problem can be solved by 
using directly the data on the frequency response of the forward link 
and the feedback link. Since the frequency response of the elements in 
the system is often determined experimentally, a method allowing the 
direct application of experimental information has advantages. This is 
the merit of the Nyquist method. Its drawback is the uncertainty 
about the degree of stability. That is, if stable, what is the magnitude 
of damping? To answer this question, we may modify the criterion to 
require no zeros of 1/F,(s) to the right of a line parallel to the imaginary 
axis in the s plane but displaced to the left. The distance —\ between 
this line and the imaginary axis specifies the minimum amount of damp- 
ing. The Nyquist criterion can again be used, with the proper modifica- 
tions on the path C, for s and the numbers of zeros m, mi, and ma. How- 
ever, to carry out this test, we have to know the value of the transfer 
functions, not at s = tw, butats = —A + iw. Hence information on the 
frequency response can no longer be used directly. Then the method 
of Nyquist loses its main advantage. In fact, a different approach to 
the question devised by W. R. Evans! is much better. We shall discuss 
this method in the following section. 
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4.4 Method of Evans. Let us consider first the case of simple feed- 
back servomechanisms. Then the basic question is to find the roots of 


the equation 
i 


l 
0 = One 1 + KOO 
with G(s) given. The Evans method determines such roots as functions 
of the gain K and is thus called the root-locus method. When this is done, 
any set of specifications on the roots gives a proper choice of the magni- 
tude of K. This method thus goes much beyond the mere satistaction 
of criterion (a) of Sec. 4.2 and actually solves the design problem for all 
three criteria stated in that section. 
Now let G(s) be specified by its zeros pı, Po » -> , Pm and its poles 
gi, q2, » » - 14m Then from the definition of gain given by Eqs. (3.16), 
(3.21), and (8.23), 


_ , @— p(s — ps) + +: (S — Pn) 
GOL ea)? = @ 18) 


(4.15) 


where 
fei, RON) A 
(=P) (=P) + + (Pn) 
For physical systems, the polynomials in the numerator and the denom- 
inator of G(s) have real coefficients. Then the p’s are either real or 
form complex conjugate pairs. Similarly, the g's are either real or form 
complex conjugate pairs. Therefore A is always real. For engineering 
systems, usually things are so arranged as to make A not only real but 
also positive. Hereafter, then, we shall consider A to be real and 
positive. Generally, the denominator of G(s) is of equal or higher order 
than the numerator, that is, n > m. Let us express each of the factors 


in Eq. (4.16) in vector form: 


s — py = Pye"! 
— (4.17) 
S — Pm = Pre” 
s—q = Qc” 
ee (4.18) 
S — qr = Qe 


The vector P,e'* goes from p, to s. The vector Q,e'* goes from g, to s. 
s is the variable point in the complex s plane. By using Eqs. (4.17) and 
(4.18), G(s) can be written as 


_ A (Pietei) (Pe?) ars (P neien) 


TT] (4.19) 


G(s) 
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Since A is real and positive, we can write Eq. (4.19) as 


G(s) = Res (4.20) 
where 
a GPP yet Pe, 
ane (QiQ2 © > Qn) Ona 
and 


O = (g1 + go + ae + Ym) — (81 + b2 + RN + An) (4.22) 


Since the P’s and Q’s are magnitudes of vectors defined by Eqs. (4.17) 
and (4.18), they are positive. Therefore R is positive. The basic equa- 
tion for the roots of the inverse system transfer function, Eq. (4.15), is 


thus 
e778 


Ee 
Therefore, to satisfy this equation, we must have 
Kk =1 (4.23) 
and 
O = +r (4.24) 


The Evans method consists of two steps: The first step is to determine 
all s’s that satisfy the appropriate angle condition of Eq. (4.24). Then, 
knowing such a root locus, we can compute R and hence K, by Eq. (4.28), 
for each point on the root locus. Evans has developed a number of 
useful rules for plotting the root locus. We shall now explain these 
rules. 

Rule 1. For K =0, Eq. (4.15) shows that G(s) > ©. Thus for 
K = 0, the roots of 1/F,(s) are poles of G(s), or the root locus starts at the 
poles of G(s). These poles of G(s) will be denoted by a dot in the s plane. 

Rule 2. For K— œ, @(s)— 0. Thus for K — œ, the root locus 
could be the zeros of G(s). We shall denote the zeros of G(s) by a small 
circle in the s plane. But if n > m, the number of zeros of G(s) is less 
than the number of zeros of 1/F,(s). However, in that case, G(s) — 0 
as s—> ©. Therefore the missing roots are supplied by s = ©. Fur- 
thermore, for very large s, 


A 


sem 


G(s) ~ 


Therefore Eq. (4.15) can be approximated by 
got = —KA 
Thus the asymptotes of the root locus have the phase angles 


T 2kr 7 
a EA k=l 2,8) 2 @ 5 (4.25) 
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Rule 3. The root locus along the real axis is along alternate seg- 
ments connecting zeros and poles of G(s) located on the real axis, starting 
with the one farthest to the right. 

This rule may be easily verified by considering any point s on the real 
axis. The angles to this point from a pair of complex conjugate zeros 
or poles are +e and —@ or +0 and —6, respectively. Thus their 
sum is zero. The angle to this point from a pole or zero on the axis 
is 0 for all poles or zeros to the left of s, and it is r for all poles or zeros 
to the right of s. Thus, the sum is r if there is an odd number of poles 
and zeros of G(s) to the right of s. 

Rule 4. If there is a breakaway of the root locus from the real axis, 
the point of breakaway may be estimated from the condition that, for a 
small displacement Aw from the axis, the increase in angle due to the 
poles and zeros of G(s) on the axis to the left must be just balanced by the 
effect of those to the right. 

Example: Consider the transfer function 


(0.001) (2) (6) 
(s + 0.001)(s + 2)(s + 6) 


At K = 0, the locus starts from —0.001, —2, and —6 on the real axis. 
Sections of the locus lie between —0.001 and —2, and between —6 and 
—o, Herem = 0, n = 3. Therefore the phase angles for the asymp- 
totes, according to Eq. (4.25), are +r/3, —r/3, andr. Breakaway from 
the real axis occurs at \i, between —0.001 and —2. By applying Rule 4, 
we have 


G(s) = (4.26) 


Aw Aw Aw 
00th 3 oe 


or 
(Ar + 2)(A1 + 6) + (Ar + 0.001) (Ar + 6) + (Ar + 0.001) (A1 + 2) = 0 


Hence 
3A? + 16.002A; + 12.008 = 0 


Therefore 


2 
oes 16.002 _ | 16.002 T 12.008 = —0.904 
6 6 3 


Rule 5. The point at which the root locus crosses the imaginary axis 
into the right-half s plane can often be estimated by taking advantage 
of the properties of the right angle. 

Example: Let us consider the same transfer function, of Eq. (4.26). 
Away from the origin, it can be very closely approximated by 


0.001) (2) 6) 
CO) Fe Oya 6) 
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Then, as shown in Fig. 4.6, 6, = 7/2. Therefore Eq. (4.24) gives 
Se ee | 

or 

a 


+s 5 


But, by referring to the figure, 


= 9, +6 and 


AIN 


T 
4 

hence 
a ~ B 


This is the geometrical condition for determining the crossover point U. 


_ (0.001) (2) (6) 
G(s)= G01) 12 GTE) 
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Rule 6. The direction of locus departure from a pole (or locus approach 
to a zero) may be easily estimated by computing the angle at the pole 
(or zero) under consideration from all the other poles and zeros in the 
field. 

Example: Figure 4.7 shows the root locus for a transfer function G(s) 
having two zeros and two poles on the real axis, and a pair of complex 
conjugate poles. For small displacements away from the pole g4, the 
angles 01, ¢2, 61, 62, and 63 from other zeros and poles remain constant. 
Thus the angle 6, is given, according to Eq. (4.24), by 


(pı + g2) — [(01 + 82 + 83) + O84] = 7 


This equation determines 64. 
These rules give the essential characteristics of the root locus. For 
intermediate locations, the root locus is found by taking a number of 
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trial points. The gain K can then be calculated along the path of the 
root locus. When the desired location of the roots of 1/F,(s) is finally 
chosen, the corresponding value of K can thus be fixed. The design 
of the feedback system is then completed. 


Fig. 4.7 


4.5 Hydrodynamic Analogy of Root Locus. By combining Eqs. (4.15) 
and (4.16), we have 


(s = g) ls = g) °° S—G) _ _ 
S= p s = pa ee p) Ea 


If we take the logarithm of the above equation and then divide the result- 
ant equation by 2r, we have 


l 1 1 f 
W(s) = x », oe (s = 4) = 5 X log (s — pi) = 5 log KA +i) 
qn] j=l 
(4.27) 


Equation (4.27) has many possible physical interpretations. A very 
illuminating one is to consider W (s) as the complex potential function of a 
two-dimensional irrotational flow of a perfectly incompressible fluid.! 
If (\,a) and W(A,w) are the potential function and the stream function, 


respectively, then 
W(s) = o,a) + Yw) (4.28) 


with s = à + iw. Therefore Eq. (4.27) for the root locus of 1/F,(s) 
can be interpreted as lines on which the stream function y assumes the 
constant value 4. In the terminology of fluid mechanics, the root locus 


1 See for instance V. L. Streeter, “Fluid Dynamics,” McGraw-Hill Book Company, 
Inc., New York, 1948. 
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is thus composed of branches of the + streamline. The potential func- 
tion along the streamline changes from point to point and is equal to 


] : 

~ log KA 
Equation (4.27) also shows that the flow is composed of n sources of 
unit strength located at the points gi, gz, . . . , Qn and of m sinks, also 
of unit strength, located at the points Pı, pe, . . - , Dm. In our graphical 


representation, sources are indicated by a dot, and sinks by a small 
circle. With this interpretation, the pattern of root loci in Figs. 4.6 and 
4,7 can be immediately “understood.” 

The hydrodynamic analogy is also very useful in suggesting modifica- 
tions of the system to achieve a better feedback performance. For 
instance, a system characterized by the transfer function 


= Giga 
G(s) = ce) gil < la 

may have the disadvantage of being unstable in closed-cycle perform- 
ance at too low values of the gain K, and thus not being able to satisfy 
criterion (c) of Sec. 4.2. The root locus is similar to that shown in Fig. 
4.6. The hydrodynamic analogy immediately suggests that the cross- 
over point U can be moved up by pulling that part of the streamline near 
U to the left, with a sink p, close to qi, and a source ge near to ge. Thus 
the modified transfer function is 


G( ) = de (s a Pe) q1g2 
Pe (8 — Ge) S(s — Q1) (8 — 42) 
The corresponding root locus is shown in Fig. 4.8. Since |p.) < |ql, 
the additional transfer function put in series with the original transfer 
function must be that of a lead network, as shown in Sec. 3.3 by Eq. 
(8.27). 

The hydrodynamic analogy also permits us to understand the possi- 
bility of speeding up the response of a slow mechanism by using the 
feedback link. According to criterion (b) of Sec. 4.2, fast response 
requires roots with large magnitudes. Now, for simplicity, suppose we 
have a linear mechanical system of first order characterized by a small 
qı on the negative part of the real axis. If we put this system in series 
with a fast-damped electric network characterized by a large q: on the 
negative real axis, the response will not be improved, because we still 
have the small qı root. But if we have closed the feedback cycle, then 
the streamline pattern, or the root locus, indicates that the smaller root 
qı will increase with increasing gain K toward the larger root go. There- 
fore, with proper choice of the gain K, we can make the roots much larger 
than qı and thus greatly increase the rapidity of response of the system. 
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The technique of plotting the root locus can also be applied to general 
feedback servomechanisms. There the problem is to plot the root locus 
of 1/F,(s) given by Eq. (4.19). Thus the condition for the root locus is 


D B 
K Gls) = K2G2(s) 


Since the roots of 1/F,(s) are different from the zeros of G2(s), we can 


A 


Modified 
one Original 


Fig. 4.8 
divide the above equation by Go(s)/Ki, and thus 
l 
AOAO = — K-K: (4.29) 
Therefore, if we put 
G(s) = Gs(s)Ga(s) E 
K = KiK: ( 9 ) 


and then compare Eq. (4.29) with Eq. (4.15), we see that the problem 
of finding the root locus of the general feedback servomechanism is 
reduced to that of the simple feedback servomechanism discussed previ- 
ously. In fact, our careful analysis of the application of the Nyquist 
method to general feedback servomechanisms in Sec. 4.3 shows that the 
method of reduction given by Eq. (4.30) can also be used there. There- 
fore, as far as finding the qualitative performance specified by criteria (a), 
(b), and (c) of Sec. 4.2 is concerned, there is no difference between the 
simple feedback servomechanism and the general feedback servomech- 
anism, if the relations of Eq. (4.30) are borne in mind. Only when the 
quantitative performance of the system is required must the differences 
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in the system transfer functions F,(s) as shown by Eqs. (4.3) and (4.7) 
be properly recognized. 

4.6 Method of Bode. At the point U where the root locus crosses 
over to the right-half s plane, the root is by definition purely imaginary, 
say 1w*. In other words, Eq. (4.15) is satisfied by s = iw*, or 

KG(tw*) = F(iw*) = —1 = le~ 

Therefore, the critical condition of transition from stability to instability 
occurs if the amplitude M of the frequency response is equal to unity 
and, simultaneously, the phase angle @ of the frequency response is 
equal to —r. This critical condition can also be deduced from the 
Nyquist criterion, which specifies the critical point as — 1 in the 1/F (iw) 
diagram. In fact, by studying a typical example, such as Fig. 4.5, it 
will be seen that for stability the 1/F (iw) curve must encircle the —1 
point. Since the magnitude of 1/F (iw) generally increases with increas- 
ing w, encirclement can be ensured by requiring that the magnitude of 
1/F (iw) be larger than 1 when the phase angle of 1/F (tw) is equal to r. 
This is equivalent to saying that M should be less than one when @ is 
equal to —r. Or, we say that 8 should be larger than —r when M = 1. 
This condition for stability is the basis of the method of Bode: the 
frequency at which the amplitude M of the frequency response is equal 
to 1 is called the point of gain crossover. The difference of 0 and —r is 
called the phase margin. The Bode criterion for stability is thus stated 
as a phase margin of 30 to 50 degrees at gain crossover. In a Bode dia- 
gram, the point of gain crossover is the frequency for logo M = 0, and 
the Bode criterion can be easily tested. 

The Bode method is similar to the Nyquist method in that the informa- 
tion on the frequency response can be used directly. This advantage of 
simplicity is counterbalanced by the disadvantage, in comparison with 
the Evans root-locus method, of not being able to know the degree of 
stability. R. M. Osborn! tried to remedy this situation by giving a 
semiempirical rule to calculate the damping coefficient ¢ for the most 


critical root. His formula is 
| a 


= m 
where a is the phase margin in degrees at gain crossover, and m is the 
slope of log M against w at the gain crossover. The unit of time for ¢ 
is the same as that for w. Thus, if a = 30 degrees and m = 1.7, then 
¢ = 1/2 X 1.7) = 0.8. 

4.7 Designing the Transfer Function. The various methods dis- 
cussed in the previous sections for determining the stability of feedback 


(4.31) 


1 R. M. Osborn, paper presented at Summer Meeting, IRE, San Francisco, August. 
1949. 
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servomechanisms are mainly methods of analysis. They are partly 
methods of synthesis, i.e., methods of designing the transfer function, but 
only in so far as they fix the range of possible values of the gain K. Of 
course, both methods can suggest changes in the structure of the transfer 
function to improve performance. This is particularly so in the case 
of the root-locus method. However, how to realize these desired changes 
in the transfer function by modifying the physical elements of the system 
is mainly an art in servomechanism engineering practice. 

Only in one aspect of this synthesis problem is a general solution 
known. This is the problem of designing an electric circuit composed of 


Compen- 
sating 
Network 


| Compensating 7 
Amplifier 


ae 
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resistances and capacitances, an RC circuit, such that the transfer func- 
tion of this electric circuit has the specified zeros and poles. Since such a 
circuit has great flexibility and is used very often to “compensate”’ the 
transfer-function characteristics of other elements in the system, and 
since the desired modifications of the transfer function can indeed often 
be put in terms of additional zeros and poles, the general solution of such — 
a problem is very important. Important contributions to this problem 
were made by E. A. Guillemin! and L. Weinberg.? We shall not pursue 
the subject here, but only emphasize the possibility of synthesizing an 
RC circuit of very complex specified properties. 

4.8 Multiple-loop Servomechanisms. The servomechanisms dis- 
cussed thus far are single-loop servomechanisms. Engineering practice 
often calls for much more complicated systems. For instance, Fig. 4.9 
is the block diagram of a typical control system? for an airplane rotating 


1 E. A. Guillemin, J. Math. and Phys., 28, 22-44 (1949). 
*L. Weinberg, J. Appl. Phys., 24, 207-216 (1953). 
3L. Becker, Aeronaut. Eng. Rev., September, 1951, p. 17. 
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about a single axis. The inner loop is the so-called control surface- 
position feedback or “follow-up.” If the inner loop is not closed, we 
have the usual feedback control, and 


Y(s) F(s)F2(s) (4.32) 


ewe 
erent eee 


(s) 1+ Fi(s)F2(s)F2(s) 
If both loops are closed, then 
A(s) = Fi(s)[X(s) — BA(s) — F(s) Y (s)] 


and 
Y(s) = Fa(s)A(8) 
Therefore 
A(s) _ F(s) 
¥G@) ~ [Pah +O ee) 
and 


(s) Fals) Pa) Ton 


X(s) 1+ BFi(s) + Fils)Fo(s)F3(s) 


The stability and response of the control system then depend upon the 
zeros of the expression 1 + BFi(s) + F1(s)F2(s)F3(s). 


Open Cycle 
Controller 


Unalterable 
Elements 
F(s) 


Series Loop 
Controller 
F(s) 


Controller 
F(s) 


Fig. 4.10 


One of the difficulties of designing a good control system is to have 
accurate control and hence large gain K together with fast response and 
satisfactory damping. This led to the idea of combining the closed- 
cycle control with open-cycle control, proposed by J. R. Moore.! Con- 
sider the system represented by Fig. 4.10, where the closed-cycle control 
and the open-cycle control are put in parallel. We have thus 


Fi(s)X(s) = Yi(s) (4.35) 
and Y(s) = Fa(s){¥i(s) + Fi(s)[X(s) — Fs(s) Y(s)]} ) 


1J. R. Moore, Proc. IRE, 39, 1421-1432 (1951). 
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Solving for the output Y(s), we have 


Y(s) _ Pa(s)Fa(s) + Fi(s)F:(s) 
X (s) 1 + F(s)Fa(s)F;(8) 


The stability and the speed of response of the system are thus established 
by the zeros of 1 + Fı(s)F:(s)F;(s). Since F(s) is fixed, the design 
problem is to find the proper transfer functions F,(s) and F;(s). The 
actual response, in particular the steady-state error, is dependent upon 
the additional transfer function F,(s) of the open-cycle controller. 
Therefore the feedback loop is designed primarily for stability and 
dynamic response, while the steady-state or “synchronizing” operations 
are largely taken care of by the open-cycle portion of the system. 

When there are many variables to be controlled simultaneously and 
when these controlled variables are also coupled, as in a steam power 
plant, then the system diagram has many loops with a complicated 
feedback scheme.! An extreme example of such complex systems is, 
perhaps, the automatic control and guidance system for airplanes.? The 
analysis of such a system, although following the same principles as 
explained in this chapter for simple servomechanisms, can hardly be 
done without recourse to analog computers. But this is only engineer- 
ing development work: the process of going from principles to practice. 


(4.36) 


1 See for instance J. Hanny, “Regelung Theorie,” A. G. Gebr. Leemann Co., 
Zürich, 1946. 
2J. B. Rea, Aeronaut. Eng. Rev., November, 1951, p. 39. 


CHAPTER 5 


NONINTERACTING CONTROLS 


For complex systems with several controlled quantities and with inter- 
action between these controlled quantities, a new design criterion gen- 
erally has to be introduced. This is the criterion of noninteraction. 
For instance, the variables of a turbojet engine with afterburning are 
the engine speed, the fuel injection rate to the combustion chambers, the 
fuel injection rate to the afterburner, and the cross-sectional area of 
the tail-pipe opening. However, the operation of this engine may be 
based upon specific settings of the speed, the fuel rate to the combustion 
chambers, and the fuel rate to the afterburner. If this is the case, it is 
obvious that one of the design criteria for the servocontrol of the system 
is the independence of the three different control settings: a change in 
fuel rate to the afterburner should not change the engine speed, and a 
change in engine speed should not require a change in fuel rate to the 
combustion chambers. The key to this particular design problem is 
then the proper manipulation of the tail-pipe opening with respect to the 
other variables and the proper design of the control servos. The purpose 
of this chapter is to give a general method for designing such noninter- 
acting controls for systems of arbitrary complexity. This general method 
was first given by A. 8. Boksenbom and R. Hood.! 

5.1 Control of a Single-variable System. Let us consider first a sim- 
ple system with one controlled output y(t) and one control setting, or 
input, x(t). Their Laplace transforms are Y(s) and X(s). Consider the 
control designed according to Fig. 5.1. ÆE(s) is the “engine” transfer 
function, L(s) is the instrument transfer function, S(s) is the servo 
transfer function, and C(s) is the “control” transfer function. Only 
C(s) can be changed easily by the designer. The system is slightly 
different from the simple servomechanism of Fig. 4.2, in that an arbitrary 
disturbance V(s) is introduced between the servo and the engine to 
account for accidental outside influences. 

The relation between input W (s) to the engine and the output Y(s) is 


` Y(s) = E(s)W(s) = E(s)[S(s)U(s) + V(s)] (5.1) 
U(s) is the output of the control transfer function and is in turn given by 
U(s) = C(s)[X(s) — Z(s)] = C(s)[X(s) — L(s) ¥(8)] (5.2) 


1A. S. Boksenbom and R. Hood, NACA TR 980 (1950). 
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By eliminating U (s) from Eqs. (5.1) and (5.2), we have 


| ESC) B(s) 
YO = FOROL T O + ECF O O3 


This is the equation for the Laplace transform of the output under 
appropriate initial conditions for y(t) and z(t). Except for the second 
term, involving the disturbance V(s), Eq. (5.3) is the same as the previ- 
ous relation, Eq. (4.3), for simple servomechanisms. The analysis of 
the performance of the system can also be carried out in a similar way. 
However, for more complicated systems, this scheme has to be general- 
ized. We shall do this presently. 


Instrument 


Fra. 5.1 


5.2 Control of a Many-variable System. Let the number of outputs 
Yi(s), Yo(s), ..., Y,(s), . . - , Yi(s) of the engine be 2 and the num- 
ber of inputs W1i(s), We(s), . . . , Wils), . . . , Wa(s) ben. Then the 
generalization of Eq. (5.1) is 


Yi(s) = E (s) Wi(s) + Es) W (8) + Pra 4- Exn(s) W2(s) 
Y2(s) = Eal) Wals) + E8) W3(s) + ye 4- Fan(s) Wals) (5.4) 


e @ ə òè è oò č »è» òè ò ç a ò ò o @# č > č ò e# ç è ç a ò ç a ò â a ç a # #& ç O ò 8 ¢ ç & â a ç b çğë ‘œ 


Y;(s) = Eal) Wils) + Eil) Wels) + + + + + Ein(s) Wals) 


Each Ez(s) is the transfer function which, when operated on the input 
W,(s), gives a component of the output Y;(s). E;(s) is then generally 
a ratio of two polynomials of s, either obtained theoretically from analyz- 
ing the engine characteristics, or determined experimentally through the 
frequency response. Equation (5.4) can be compressed into 


Y,(s) = X Bal) Wals) (5.5) 


k=1 


The array of the quantities E,,(s) can be conveniently called the engine 
transter-function matrix E. We may then consider that the inputs 
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W,(s) “enter” the matrix as columns and that the outputs Y,(s) “leave” 
the matrix as rows, as indicated in Fig. 5.2. We shall be concerned 
with the cases where the number of inputs is greater than the number of 
outputs, thatis,n > i. Therefore the matrix Æ is rectangular with more 
columns than rows. For later use, a square matrix obtained by using 
only the first îi columns is denoted by E*. 

Since the number of inputs to the engine is greater than the number of 
outputs, the system behavior is not determined by merely giving the 
settings X,(s), where j = 1, ... , 2, for the outputs Y,(s), but in addi- 
tion the settings =, (s) for the variables W,(s), where u =7+1,...,7, 
must also be specified. The controlled quantities are then the outputs 
Y,(s), for vy =1,..., 4%, and the n — i engine inputs W,(s). If the 


w We W; W, 


measured values of W,(s) after the instrument are denoted by T,(s), 
then the errors are £,(s) — T,(s). The errors of the engine output are 
defined as the differences X,(s) — Z,(s), where Z,(s) for v = 1, , 1, 
are the measured values of the output after the instrument, as shown in 
Fig. 5.1. The function of the control is to take these errors as inputs and 
to generate correction signals U;(s) for the servos. This is the feedback 
link. In the present generalized control system, the correction signals 
U,(s) are made to depend linearly upon all errors. Since there are n 
error signals, there are n correction signals; k thus ranges from 1 to n. 
Thus 


Ui(s) = Ciy(Xy — Z) + Cie(Xe — Ze) Fos + Cali — Za 

+ Chere — Tort) ft + Cina — Ta) 
Uls) = Cai(X1 — Zd) + CalX: — Ze) to + + Co(X; — Z 

+ Cinan — Tos) to + Cyn — Ta) | (5.6) 
U,(s) = Cri(X1 — Zi) + Cno(X2 — Zo) +: + Cnil X: — Zi) 

+ Ch Ben — Ta) bee + Ci (an — Tn) 
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where we have separated the control matrix into C and C” to indicate 
that two kinds of error signals are involved. Equation (5.6) can be 
compressed into 


: Se at 
Ul) = Y Col(X-Z)+ Y Ce - T) poy n GD 
p=] u=iti 


Each of C;, and C% is, of course, a ratio of two polynomials in s. Equa- 
tions (5.6) and (5.7) can also be represented graphically as shown in 
Fig. 5.3. 


— n en ae aa a a -e ee ee ee oe ee 


(X,—Z,) (X2~Zo) (X;~-Z;) (Emha) (En -Ta) 
$$$ m 
C c’ 
Fie. 5.3 


The measured values of Z,(s) and T,(s) are related to Y,(s) and W,(s) 
by the transfer functions L,,(s) and L,,(s) of the instruments: 


Z,(s) = L,,(s)Y,(s) (5.8) 
T,.(8) = La(s) W,(s) (5.9) 


The correction signals will act individually on the servos, and the 
outputs of the servo when combined with the accidental outside dis- 
turbances V;(s) give the inputs W,(s) to the engine. If S,(s) are the 
transfer functions of the servos, then 


Wils) = Sels) U;(s) + V;(s) k = Ll 2, oe eo yt (5.10) 


Equations (5.4) to (5.10) completely describe the control system with 
many variables. Figure 5.4 is a block diagram for a system with three 
engine outputs Yi(s), Yo(s), and Y3(s) and two controlled inputs W,(s) 
and Ws(s). The entire system is enclosed except for the settings and 
the outside disturbances, which can be imposed on the system. 
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By eliminating U;(s), Z,(s), and T,(s) from the previous system of 
equations, we have 


n 4 


iis} = Y {F BaO Sal) = Lal) Y) 
k=1 yvy=1 
T > Erl) Sux(8)Ciy(8)[Eu(s) — Lels) Wu(s)] + EnV (3) (5.11) 
and a 
Wels) = Y Sal) CNX) = LOY] 
v=] 


n 


+) SOCO) — LopWy] + Ve (8.12) 


p=itl 


Equations (5.11) and (5.12) suggest a more compact block diagram for 
the system than Fig. 5.4. This, as shown in Fig. 5.5, involves a single 


Yi, V V Ve 
ewe Servos U Control s V% 


Cir Cy Cia Cig Cis 
C21 C22 C23 Co4 C35 
C31 Caz Caz C34 C3s 
Car Caz Caz Cay Cis 


Ey, Eyz Eis Eu Es 
Ez; E22 E23 Ex, Eas 


E31 E32 E33 E34 Ess 
Engine a tele eg 
ee rr 


X, X As =, Es 
Settings 
Frc. 5.4 
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systems matrix whose inputs are the error signals of the controlled 
variables and whose outputs are the controlled variables. The ESC 
matrix in Fig. 5.5 is a matrix in which the element in the jth row and 
yth column is EySuCr,. Similarly, for the ESC’ matrix, the element 
in the jth row and uth column is EySC},. Similarly, the elements 
of the SC matrix are S:,C,, and the elements of the SC’ matrix are S:iC;,,. 
The outside disturbances V; are introduced through another matrix com- 
posed mainly of the engine matrix &. 

5.3 Noninteraction Conditions. The criterion of noninteraction of 
controls can now be formulated in concrete terms. The problem is to 


X, X2 X; E, Bs 


System Matrix 


Fra. 5.5 


determine conditions on the elements of the control matrix C;,(s) and 

‘u(s) Such that the settings X;(s) and 3,(s) will affect only their respec- 
tive corresponding variables Y;(s) and W,(s), where j = 1, 2,...,4 
and w=7i+1,..., n, and nothing else. Thus, for example, the 
setting X2(s) will modify only Y2(s), while the setting 2:1:(s) will modify 
only W,1(s). The mathematical problem is thus one of “ diagonalizing”’ 
the system matrix of Fig. 5.5. We put the design condition on the con- 
trol matrix, because this is the part of the whole system most easily 
modified by the designer. The characteristics of the “engine,” the 
servos, and the instruments are considered to be fixed and not at the 
disposal of the control engineer. 

Let us study first a specific output Y,(s), with g assuming any one 
of the possible values 1, 2,...,%. Equations (5.11) and (5.12) can be 
written then as 
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Y;(s) = ) | y Ey SinCr(X, — L,Y) 


k=1 »=1,»=*9 
n 


-+ ) Ep SteC y(n — Lu Wa) + EnV | 


w=t+l 
n 


+ y EypSiCrg(Xg — LuYo) 


= 
ih 
oy 


Wals) = J Sikl X» B LY») +- > Ork iy (Ey rag Luu W 4) + V: 
r=], v £g p=i+ 1 
+ SuxCrg(Xo — LoY o) 
Now in order that a setting X, will not influence any Y; or W, except 
Y,, the last terms of the above two equations must be zero for j Æ g and 


for k >i. Therefore for any g among the set 1, 2,... ,2, 
ES trC ko =0 for] xg (5.13) 
t=1 
and 
Cry =0 fork >1 (5.14) 


Equation (5.14) gives an immediate simplification of our control 
matrix. For instance, the example of Fig. 5.4 corresponds to 2 = 3, 
n= 5. Then Eq. (5.14) specifies that 

Cu = Cy = C43 = Csi = Cs. = Css = 0 
Equation (5.14) can also be used to simplify Eq. (5.18); it is in fact 
equivalent to 


` VENT == i ÒE gek eg (5.15) 
k=l k=1 
where g is any among the set 1, 2,..., dand ô; is the Kronecker 
delta, t.e., 
i,=0 jx 4 
5.16 
wet ge en 
For any specific g, Eq. (5.15) is essentially a system of 7 — 1 linear 
algebraic equations for 1 unknowns SCrg, where k = 1, 2,..., i. 


Therefore we can determine only the ratios of these unknowns but not 
their absolute values. This is exactly what is desired, as we do not wish 
to fix the control transfer function absolutely and thus lose freedom of 
design. 

To find these ratios of the control transfer functions, we shall utilize 
a property of determinants: Let |Ež| be the cofactor of the Æp element 
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in the determinant |E*| formed out of the square matrix #*, then 


> Epl =0 kel 
and (5.17) 
} EplEš| = |E*| k=l 


Multiplying Eq. (5.15) by |Ež| and summing over j, we obtain 


) YE *|SoB Sues = 5 ME jE w 


k=1;/=1 k=l j=l 


Therefore, because of Eq. (5.17), we have 


SuCy = |E* y EySiCry /|E*| 1=1,2,...,i (5.18) 
k=1 
In particular, 


Boa ae [EF > EySuCrs / |E* 
k=1 


Then by taking the ratio of Eq. (5.18) to the above equation, we can write 
yi _ [ER l 
Soy. E ee 
This equation gives the off-diagonal elements of the matrix SC in terms 
of the diagonal elements. 

The conditions of Eqs. (5.14) and (5.19) are then the necessary con- 
ditions for noninteraction of the controlled variables Y, They were 
given by Boksenbom and Hood. The same authors proved that these 
conditions are also the sufficient conditions for noninteraction. There- 
fore the problem of finding the appropriate control matrix C is completely 
solved. 

To solve the problem for the other part of the control matrix C’, we 
have to consider the noninteraction conditions for the controlled variables 
W,, wheren =i+1,...,n. For this purpose, we rewrite Eqs. (5.11) 
and (5.12) as 


=1,2,...,% (5.19) 


Y,(s) = » ») R A © eon E ge 


=] y=] 
n 


+ > EnSuC yy Ep 7 Luu W 4) + EV | 
pail 
hr 


a D ESuxC,(Z, ~ LW.) (5.20) 
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and 
Wels) = Y SuCu(X—Ln¥,) + Y Salih, — LW) + Ve - 
p=] p=itl 
usr 
+ SiC. — LerW,) (5.21) 
whererisany among the seti +1,...,nandj=1,2,...,72 Forthe 


present purpose, the k index in Eq. (5.21) is any among theseti +1, ..., 
n, because only these W;’s are the controlled variables. Itis evident from 
Eqs. (5.20) and (5.21) that in order for the setting Z, to influence only 
the variable W,, the last terms in these equation must be zero. That is, 


n 


> ESaC =0 G=1,2...,1 (5.22) 
k=l 
and 
w =O forkr=i+1,...,nandk#r (5.23) 


Again, Eq. (5.23) gives an immediate simplification of the control 
matrix: For the example represented by Fig. 5.4,7 = 3, n = 5, and so 
6 = Coa = 0 


Equation (5.23) can be used also to simplify Eq. (5.22). That equation 
is reduced to 
` ESk = — LSC, 


k=l] 


Multiply both sides of the above equation by |£3| and sum overj. Then 


i > [ES EnS ir = —SrrCt, ) || Ese 
j=l 


k=l j=l 


By using the properties of determinants as given by Eq. (5.17), we have 
B*|SuC, = -8v0 Y, |E}lEs 
j=l 


Therefore we can write the above equation in the following form, by 
replacing | by 7, and j by J, 


SiC}, l 
sich ~ ~ fm), FE, 


[=1 


9 Ay See Gt 
rst lesen 


(5.24) 


This equation then gives the off-diagonal elements of the control matrix 
SC’ in terms of the diagonal elements. Equations (5.23) and (5.24) are 
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the necessary and sufficient conditions of noninteraction for the con- 
trolled variables W,(s), for u =2-+1,...,%. 

For complete noninteraction of all controlled variables, the conditions 
specified by Eqs. (5.14), (5.19), (5.23), and (5.24) must be satisfied. 
The off-diagonal elements of the complete control matrix are either zero 
or expressed in terms of the diagonal elements. When the engine char- 
acteristics as expressed by the engine matrix are known, the diagonal 
elements of the control matrix determine completely the whole control 
matrix. 

5.4 Response Equations. With the noninteraction conditions all 
satisfied, Eqs. (5.11) and (5.12) can be made much simpler. For instance, 
by interchanging the two summations, 

Yl) = ) XO) - LNA] ) BSC 
k=l 


r=] 


n n 


+ YEO = Lul Wo)) Y Eesau + Y EnV: 
u=i-+1 k=1 k=1 

But according to Eqs. (5.13) and (5.14), the sum over k of the first 
term vanishes except when v = j. According to Eq. (5.22), the second 
term vanishes. Thus 


Yi(s) = [Xi(8) — Li(9)¥i(8)]_) BwSuCes + Y Bal 
k=l k=l 
Now SCi can be expressed in terms of the diagonal element S,C, 
according to Eq. (5.19). Thus, using Eq. (5.17), we have 


2 4 


X ESC = e Enl ER] = SC 
Ji ra A 


|E*| 
oes 


k=l 
Therefore, finally, 
E*| 
Y;(s) = EF] SiiCylXi(s) — Ly(s)¥3(s)] + ) EyeVi (5.25) 
Jd k=l 
By using the noninteraction conditions, a similar calculation will reduce 
Eq. (5.12) to 


W,(s) = SpuC uulnus) — Lals) Wals)] + Vis) 
u=i+1l,..., n (6.26) 

By writing 
R, |E* SiC 


iT E BCLs + TEA oe 
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and 


R' = SiGe 
= OC Doi T 1 


the solutions of Eqs. (5.25) and (5.26) can be written as 


(5.28) 


Y;(s) = Ry(s)Xi(s) — [Ral Lals) — 11 ) Bal) Val) (6.29) 
k=1 
and 


W,(s) = Ria(S)E(8) = EMOTE, — 1V) (5.30) 


Equations (5.29) and (5.30) give the relations for calculating the con- 
trolled variables from the settings and the disturbances. They are quite 
similar to Eq. (5.3) for the simple system of one controlled variable. 
The function KRy(s) is the over-all transfer function from input X;(s) 
to output Y;(s). The function #/,,(s) is the over-all transfer function 
from the input Z,(s) to output W,(s). These over-all transfer functions 
are calculated according to Eqs. (5.27) and (5.28) using the character- 
istics of the engine, the servos, the instruments, and the control. In 
fact, the procedure of design will be to determine for each j and u the 
proper control transfer function Cy(s) or C’,(s) for satisfactory perform- 
ance by methods explained in Chap. 4. The nondiagonal elements of 
the control matrix are then determined by Eas. (5.14), (5.19), (5.28), and 
(5.24). When this is done, we have a noninteracting control of good 
performance for a complex, many-variable system. 

5.5 Turbopropeller Control. As a simple example of the general 
theory of noninteracting controls, let us consider the control problem of a 
turbopropeller engine (Fig. 5.6). The variables of the operation of such 
an engine are the speed of rotation, the turbine-inlet temperature, the 
propeller blade angle, and the fuel rate. The control system has to be 
designed for various possible steady-state normal operating conditions. 
For each steady-state operating condition, we have to investigate the 
control performance in nonsteady states near that particular operating 
point. Let W,i(s) be the Laplace transform of the deviation of propeller 
blade angle from the normal point, and W2(s) be the Laplace transform 
of the deviation of fuel rate from the normal value. Then since we are 
interested in nonsteady states near the normal point, the relation between 
the excess of turbine torque over the torque absorbed by the compressor 
and the propeller, and the propeller blade angle and fuel rate can be 
linearized. Therefore the excess torque is represented by a linear combi- 
nation of Wi(s) and W2(s). Let the Laplace transform of the deviation 
of the rotating speed from its normal value be Yi(s). Then the excess 
torque is represented by (1 + 7s) ¥i(s), where 7 is the characteristic time 
constant due to inertia of the rotating components of the power plant, 
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cf. Eq. (4.1). The value of 7 depends upon the normal operating point. 


Thus 
(1 + 7s) ¥i(s) = —aW,(s) + bW2(s) (5.31) 


where a and b are positive real constants, deduced from the engine 
characteristics near the normal operating point. The physical meanings 
of a and b are as follows: If the fuel rate is held at the normal value, 
W2(s) = 0. Equation (5.31) then gives a as —Y1(0)/Wi(0). Buts = 0 
corresponds to the steady state, and therefore a is the ratio of decrease of 
the steady-state engine speed to increase in propeller-blade angle with the 
fuel rate held constant. If the steady-state engine performance is given 
as graphs of engine speed versus propeller blade angle at various con- 
stant fuel rates, then a is the slope of this graph evaluated at the chosen 
normal operating point. Similarly, the meaning of 6 is the slope of the 


Combustion Chamber 


Propeller Compressor Turbine 
Fic. 5.6 


steady-state engine-speed versus fuel-rate curve with constant propeller 
blade angle, evaluated at the chosen normal operating point. Thus the 
constants a and b are specified by the steady-state performance curves 
of the engine. 

For an axial compressor, the mass air flow through the compressor 
for a certain inlet condition is almost constant at a given compressor 
speed. Therefore, with given inlet conditions, the ratio of heat added to 
the gas to the mass of the gas is a function of engine speed and fuel rate. 
Hence the engine speed and fuel rate determine the turbine-inlet temper- 
ature. Let Y2(s) be the Laplace transform of the deviation of turbine- 
inlet temperature from the normal value. Then an equation between 
Y,(s) and Wi(s), with W,(s) similar to that in Eq. (5.31), can be estab- 
lished. However, since the characteristic time for reaching thermal 
equilibrium of the gas is practically zero, the relation is simpler: 


Yo(s) = cWe(s) — eY (s) (5.32) 


where c and e are again positive real constants. In fact, c is the slope 
of the turbine-inlet temperature versus fuel-rate curve at constant engine 
speed, while e is the slope of the turbine-inlet temperature versus engine- 
speed curve at constant fuel rate; all evaluated at the chosen steady-state 
operating point. 
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By solving for Yi(s) and Y2(s) in Eqs. (5.81) and (5.32), we have 


b 
Yi(s) = ra F: s: Wi(s) + EF ——— W2(s) — 
B pho be) + crs l 
Y:(s) = a (s) + rena E W2(s) 


These equations specify the engine matrix E in our theory. It is inter- 
esting to note that there is only one time constant r in the engine matrix. 
Only this time characteristic is intrinsic to the engine. The complete 
control system has, of course, other time constants. But the other time 
constants are introduced by the control functions, the servos, and the 
instruments, and are not in the engine matrix. 

Let us consider first the case of controlling the engine speed and the 
fuel rate. Thus the controlled variables are Yi(s) and We(s). We need, 
then, only the first equation of Eq. (5.33), andi = landn = 2. Hence 
the engine matrix has only two elements: 


—q b 
Er = I +7s Ezg = Por (5.34) 
and 
E*| = |E ]Eu = En Epl = 1 (5.35) 
The control system is represented by 
Uls) = Crr(s)[Xi(s) — Luls) ¥i(s)] + Cie(s)[He(s) — Lee(s)W2(s)] 
(5.36) 
U2(s) = Car(s)[Xi(s) — Lius) ¥i(s)] + Coo(s)[Za(s) — Leo(s)W2(s)] 
The noninteraction conditions require 
Cals) = 0 (5.37) 
and, using Eq. (5.35), 
Sir(s)Cyo(s) _ —|ER| Ei |En EA Er 2 (5.38) 


S22(s)C4,(s) ) C(s) EF u 4a 


Since —a is the partial derivative of engine speed with respect to the 
propeller blade angle, and b is the partial derivative of engine speed 
with respect to the fuel rate, the ratio b/a is the rate of change of the 
propeller blade angle with change in fuel rate at constant engine speed. 
Clearly this ratio is a function of flight conditions of the turbopropeller 
engine. For instance, this ratio b/a increases as the altitude increases. 
Therefore a properly designed control requires means of compensating 
for changes in flight and operating conditions of the engine. 
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The response function R::(s) for the engine speed is then 


B aSir(s)Cir(s) 
Ri(s) T aS 11(s)C13(s)Ln1(s) = (1 + T8) | 

and for the fuel rate (5.89) 
Sox(s)C%,(s) | 


Ral) = BGC (s)Lan(s) F 1 


These equations determine the noninteracting response characteris- 
ties of the engine speed and the fuel rate. The problem is to design 
the control transfer functions C1:(s) and C(s) in such a way that the 
performance is satisfactory for the full range of expected operating 
conditions. 

Now consider the second possibility of turbopropeller control. We 
shall control the engine speed and the turbine-inlet temperature. The 
controlled variables are then Y.(s) and F(s). Thus in this case, we 
need both equations of Eq. (5.33), andi =n = 2. Then the noninter- 
action conditions specify 


SOLO D. 
S11(8)Cu(8) (c — be) + crs 

and Su(s)Cr(s) _ b (5.40) 
Seals \C'o2(s Soo(s)Ca2(s) a 


The response function for the engine speed is then 


Sii(s)Cir(s) 
als) =< Suu) ay 
Sir(s)Cu(s)Lu(s) — en Be L 


and for the turbine-inlet temperature 


_ S22(s) Co2(s) 
S22(8)C22(s)Le0(s) + (1/6) 


5.6 Turbojet Engine with Afterburning. We shall now treat the prob- 
lem of controlling a turbojet engine with afterburning, mentioned at the 
beginning of this chapter. The physical components are sketched in 
Fig. 5.7. We shall again study the problem of control for nonsteady 
states near a chosen normal steady-state operating point. Therefore 
linearization of the relations between the different variables is allowed. 

Let Y,(s) again denote'the Laplace transform of the deviation of 
engine speed from the normal value, W:(s) the Laplace transform of the 
deviation of the tail-pipe opening from the normal value, W2(s) the 
Laplace transform of the deviation of combustion-chamber fuel rate from 
the normal, and, finally, W3(s) the Laplace transform of the deviation of 


Fa2(s) 
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the tail-pipe fuel rate from the normal. Then we can write, in a form 
similar to Eq. (5.31) for the turbopropeller. 


(1 + rs) Vi(s) = a Wils) + a2W2(s) + a3W3(s) (5.42) 


where @1, @2, and a; are real constants. As in the case of the turbo- 
propeller, these constants are slopes of the steady-state performance 
curves of the engine. Thus a; is the rate of change of engine speed 
with respect to tail-pipe opening at constant fuel rates to the engine 
combustion chamber and to the tail pipe. a» is the rate of change of 
engine speed with respect to the engine fuel rate. a3 is the rate of 
change of engine speed with respect to the tail-pipe fuel rate. r in 


Tail Pipe Combustion 


Combustion Chamber 


Compressor Turbine Variable Exit Nozzle 
Fie. 5.7 


Eq. (5.42) is again the only characteristic time of the engine system and 
represents the effects of the inertia of the rotating components. This 
linearized relation between the engine speed and other, engine-input 
variables was derived by M.S. Feder and R. Hood.! 

If the compressor of the engine is an axial compressor, Eq. (5.32) of 
the previous section is again applicable here. Yo2(s) represents the 
turbine-inlet temperature, and thus 


Y2(s) = —eY (s) + cW,(s) 


By solving for Yi(s) and Y2(s) in the above equation and in Eq. (5.42), 
we have 


Viejas 2 a oe ee 


1+ 7s 1+ 7s 1 + rs 
ae (c — ae) + crs 
Y.(s) = ar Wi(s) + cea W2(s) (5.48) 
Q3€ 
[+ rs Wals) 
The elements of the engine matrix are thus 
= Qi = d - a3 
BS Tas ee ak arr 
(5.44) 
E _ ae _ (C= ae) ters p _ 2 
ns oe” es We 3 +r 


1M. S. Feder and R. Hood, NACA TN 2183 (1950). 
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Let us consider the problem of controlling the engine speed, the turbine- 
inlet temperature, and the tail-pipe fuel rate. The controlled variables 
are then Y1(s), Yo(s), and W3(s). The control equations are then 


Uls) = Cir(s)[Xi(s) — Lir(s) Yi(s)] 
+ Cr9(s)[Xo(s) — Lals) ¥2(s)] + Cis(s)[Ea(s) — Lsa(s)W2(s)] 

Vals) = Calal) — Luls)Ya(0)] bia 
+ C22(s)[X2(s) — Lals) Yo(s)] + Co5(s)[Ea(s) — Lss(s)W(s)] 

U3(s) = Csi(s)[X1(s) — Lauls) Yi(s)] 
+ C32(s)[X2(s) — Lals) ¥a(s)] + C4(s)[Es(s) — Lsa(s)Ws(s)] 


where X;(s), Xo(s), and %;(s) are the settings for the engine speed, the 
turbine-inlet temperature, and the tail-pipe fuel rate, respectively. 
The noninteraction condition of Eq. (5.14) requires immediately that 


C'31(8) = C'32(s) = 0 (5.46) 
The condition of Eq. (5.19) gives 


Sir(s)Cx0(s) _ a | 

S22(s)Co20(s) ay 

S22(s)C21(s) = Aye | one 
S1,(s)Ci4(s) 7 (c = a£) + CTS 


The noninteraction condition of Eq. (5.24) gives 


Si1($)C43(8) _ _ %8 
S33(s)C33(8) Qı 
and 
Ch3(s) = 0 (5.48) 


The ratios —d2/a; and —a3/a; in the above equations have simple phys- 
ical meanings: —d2/ai is the rate of change of tail-pipe opening with 
respect to engine fuel rate at constant engine speed and constant tail- 
pipe fuel rate. —da3/a; is the rate of change of tail-pipe opening with 
respect to tail-pipe fuel rate at constant engine speed and constant engine 
fuel rate. 

When Eas. (5.46) to (5.48) are satisfied, we have noninteracting con- 
trol, and the response function for the engine speed is 


oo Sn()Onls) 
SCO E (5.49) 


QC 


Rix(s) = 


The response function for the turbine-inlet temperature is 


So0(s) Cals) 


me = E TE e & (Le) 


(5.50) 
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The response function for the tail-pipe fuel rate is 
hfe) x S33(s)C3,(s) 
Beals) S3a(8)Cz9(8)Laa(s) + 1 6.51) 


These equations then give the starting point of proper design of the 
control transfer functions Cii(s), C22(8), C4,(s), and hence C2(s), Coi(s), 
and C%,(s). 


CHAPTER 6 


ALTERNATING-CURRENT SERVOMECHANISMS 
AND OSCILLATING CONTROL SERVOMECHANISMS 


In this chapter and the two following chapters, we shall extend the 
concepts and methods developed for simple servomechanisms in Chaps. 2 
and 3 to linear systems which are more complicated but nevertheless 
can be treated by approximately the same technique. Therefore they 
demonstrate the power of the basic principles of servomechanism design. 
The contents of this and the next chapter follow closely the treatment of 
L. A. MacColl.t 

6.1 Alternating-current Systems. So far, whenever we have been 
considering a servomechanism containing an electric motor, we have 
assumed implicitly that the motor is a d-c motor. In practice, however, 
it may well be desirable to use a-c motors. It is clear that the use of such 
motors necessitates the reconsideration of some parts of our previous 
discussion. 

Consider a servomechanism as sketched in Fig. 6.1. The purpose 
of the system is to turn the motor to angle ¢, according to the input 
signal. The output angle ¢ is measured by a potentiometer. The volt- 
age across the potentiometer is the feedback signal. In this system all 
of the currents and voltages appearing in the amplifier, motor, and 
potentiometer are modulated sinusoids, t.e., sinusoidal functions of fixed 
frequency, say wo, but with time-varying amplitude. The basic alternat- 
ing current is generated by the oscillator. When a certain condition, 
which will be discussed presently, is satisfied, much of the earlier theory 
is applicable to this system. 

Let us consider for a moment the general steady-state theory of 
linear systems of constant coefficients subjected to signals which are 
modulated sinusoids. Here the expression “steady state” refers to the 
fact that the modulating signals are assumed to be purely sinusoidal 
functions of time. Let the unmodulated “carrier” be cos wt. The 
phase angle is here neglected without loss of generality. Since the 
carrier is expressed in real form, it is obviously legitimate to take the 


1L. A. MacColl, “Fundamental Theory of Servomechanisms,”’ D. Van Nostrand 
Company, Inc., New York, 1945. 
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modulating signal in complex form, e. Then the modulated carrier is 
x(t) = et cos wot = geltet 4 eileen] (6.1) 


and the steady-state response Yzreray(t) of a system having the transfer 
function F(s) is, according to Eq. (2.16), 


Yaaay Ò = Zl (tw + iwert + Fio — two)e ei (6.2) 


For a physical system, the function F(s) is generally a ratio of poly- 
nomials in s with real coefficients. Then, as has already been indicated 


Input 


Oscillator š 


Frc. 6.1 


by Eq. (8.17), 
F(—tw) = F(a) (6.3) 


where the bar over the symbol indicates the complex conjugate value. 
Therefore we can write Eq. (6.2) as 


LF * (iw) eit + F*(—iw)e eet | (6.4) 
where 
F* (iw) = F(tw + iwo) (6.5) 


Now we suppose that the system is such that we have the relation 
F (iwo + iw) = Flw — tw) | (6.6) 
Then the expression of (6.4) can be written in the form o 
F* (tw) e™* cos cool 


This result shows that when the condition of (6.5) is satisfied, the ampli- 
tude of the response of the system to the modulated carrier of Eq. (6.1) 
is the same as the response of a system having the frequency response 
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F*(iw) at the input frequency w. This statement can be immediately 
generalized to apply to more general input functions by the principle 
of superposition for linear systems. If Eq. (6.6) is satisfied, at least 
approximately, throughout a range of values of w which includes the 
more important parts of the Fourier spectrum of a modulating input 
signal z(t), the amplitude of the resulting modulated output signal is at 
least approximately equal to the response of a system having the fre- 
quency response F*(iw) to the input signal z(t). We have shown in 
Chap. 4 that the performance of the feedback servomechanism is deter- 
mined completely by the frequency response. The approximate fre- 
quency response now is F*(iw). Then all the methods for determining 
the performance of a system developed in Chap. 4 can be applied to the 
a-c systems. The only difference is the use of F*(iw) instead of F (iw) 
in the analysis. 

6.2 Translation of the Transfer Function to a Higher Frequency. If 
we leave out of account certain trivial systems, e.g., pure resistances, it 
follows from Eq. (6.3) that 


F (iw + tw) = F(—twy — tw) 


This is different from the condition of Eq. (6.6). Therefore Eq. (6.6) 
cannot be satisfied exactly for all real values of w. Or, if we alter our 
point of view slightly, we can say that the frequency responses F (iw) 
and F*(iw) of two physical systems cannot rigorously satisfy the relation 
of Eq. (6.5) for all real values of w. Nevertheless, it is entirely possible, 
and indeed quite common, for the frequency responses F* (iw) and F(iw) 
of two physical systems to satisfy the relation of Eq. (6.5) approximately 
over a range of values of w which is large enough to include the more 
important parts of the Fourier spectra of the particular input signals 
we are concerned with. We can see this briefly as follows: 

Consider the impedance Z of an inductance L and a capacitance C 
in series, at a frequency w’ 


If we make L and C of such magnitude that 


= 
li 


l 
I0 (6.7) 
then 
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For w — wo = w small, or w + wo near wo, 
Z = 2Lilw — wo) = 2Liw 


That is, at the frequency œw = wo + w, the impedance of the series com- 
bination of L and C satisfying Eq. (6.7) is approximately equal to the 
impedance of an inductance 2L at the frequency w. 

Similarly, consider the impedance Z of an inductance L and a capaci- 
tance C in parallel, at the frequency w 

1 1 ; 

VA = Ta +- Cw 

œ 2Ct(w’ — wo) = 2Ciw 


if condition (6.7) is satisfied. Therefore at the frequency w’ = w + wo, 
the impedance of the parallel combination of L and C satisfying Eq. (6.7) 
is approximately equal to the impedance of a capacitance 2C at the 
frequency w. 

The impedance of a pure resistance is, of course, independent of the 
frequency—it is the same at the frequency w + wo as at the frequency w. 
Thus, starting from a physical system with a transfer function F*(s), we 
can, by replacing any inductance L by a series combination of inductance 
¿L and capacity C = 2/Lw? and by replacing any capacity C by a parallel 
combination of capacity $C’ and inductance L = 2/Cw?, obtain a physical 
system having a transfer function F(s) such that the relationship of Eq. 
(6.6) is satisfied approximately for small values of w. This procedure 
of going from F*(s) to F(s) is called translating the transfer function on 
the frequency scale by wo. 

Let wo denote the frequency of the current supplied by the oscillator. 
It is clear that all of the currents and voltages in the system are modula- 
tions of the carrier wave cos wot. Hence it immediately follows from 
the above that, in order to design the amplifier for the system with 
alternating current, we need only design a suitable amplifier for a system 
with direct current by methods described in Chap. 4 and then translate 
the characteristics of the amplifier upward on the w scale by the amount 
wo, in accordance with the procedure sketched above. 

As we have indicated, the foregoing arguments involve a considerable 
number of approximations of one kind or another. An entirely com- 
plete discussion of servomechanisms with alternating current would 
necessitate an examination of the effects of all these approximations. We 
shall not go into this investigation, because it would be involved and 
tedious, and because it does not appear to be a very urgent matter as 
far as servomechanism art is concerned. 

6.3 Oscillating Control Servomechanisms. We shall now consider 
another class of systems, which we call oscillating control servomecha- 
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nisms. These resemble servomechanisms with a-c motors, in that in both 
cases the signals are caused to modulate a periodic oscillation. How- 
ever, in the case of oscillating control servomechanisms the modulation 
employed is not the ordinary amplitude modulation. In order to intro- 
duce the concept of an oscillating control servomechanism intelligibly, 
we must first give a little preparatory discussion. 

One very primitive but very common kind of servomechanism can 
be described as follows. Suppose that to the system we were to add 
a circuit containing a relay, designed to function so that no voltage would 
be applied to the output terminals unless the absolute value of the input 
x(t) exceeded a certain threshold, and so that when |z| did exceed the 
threshold the output would be the full electromotive force E of a source 
applied with such a polarity as to tend to reduce the absolute value 
of the error. We would then have an example of what we call an on-off 
servomechanism. 

On-off servomechanisms have the great advantage that comparatively 
simple systems of this kind can be made to handle large amounts of 
power. This is often difficult to achieve with servomechanisms of other 
types. On the other hand, on-off servomechanisms are definitely non- 
linear systems, and, as will be shown in Chap. 10, their performances 
tend to be inferior to those of the systems we have considered previously. 
Briefly, an oscillating control servomechanism is a modification of an 
on-off servomechanism, which enables us to secure the advantage of 
linearity without sacrificing the advantage of large power-carrying 
capacity. 

Before proceeding to the treatment of oscillating control servomech- 
anisms proper, we shall present a general theoretical result, upon which 
the theory of all such systems is based. Let us consider a device having 
the following property: According as the input signal z(t) is positive or 
negative, the output signal y(t) is +A or — A, where A is a fixed constant. 
We may think of such a device as an ideal relay, an on-off system with 
zero threshold. Suppose that the input signal to the relay is 


v(t) = Ey sin wot + KE) sin wt (6.8) 


where Ho, k, wo, and w are constants. In connection with oscillating 
control servomechanisms, the term Ey sin wot will be a persistent oscilla- 
tion in the system, and kE sin wt will be an applied signal or modulating 
signal. We shall calculate the corresponding output y(é) presently. 

6.4 Frequency Response of a Relay. The output of the relay in 
response to the input of Eq. (6.8) can be written in the form 


3 
to s 


> Onm SIN [(Mao + nw)E] (6.9) 


O n= 
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where the a’s are independent of t. When m = 0, the inner summation 
is to be extended only over positive values of n. For our purposes the 
only coefficients that are of any immediate interest are diy and ap; 
for in the case of an oscillating control servomechanism operating under 
normal conditions, the other coefficients either are negligibly small, or 
they correspond to oscillations which are suppressed by suitable filtering. 

When k = 0, z.e., when the input to the relay is a purely sinusoidal 
function with frequency wo, the output from the relay is obviously a 
series of alternate positive and negative square waves of height A and 


Output and Input 


duration wo/4r. It is known that the amplitude ao: of the leading term 
of a Fourier expansion of such a square wave is equal to 


Qo = pa (6.10) 

When k #0, the output from the relay is presented in Fig. 6.2. 
The difference of the outputs with k = 0 and with k = 0 is a series of 
rectangles of height 2.4, indicated as shaded areas in Fig. 6.2. When 
|k| <1, the change of switching points of the output from the evenly 
spaced points of tn = nt/w is very small. Thus the correction rec- 
tangles are very narrow, as shown in the figure. The width of these 
rectangles can be calculated as the value of the modulating signal at tn 
divided by the slope of the persistent oscillation at tn. Thus the width is 


kEo sin wt, 


COE | aah 
Ko COS wota Wo 


These rectangular areas are to be added (+) or to be subtracted (—) 
from the unmodulated output according to whether sin wt, is positive or 
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negative. Thus the areas of the rectangles can be considered as 


2Ak 


— sin why 
Wo 


The coefficient aio in Eq. (6.9) is the coefficient of the leading sin wt 
term in the Fourier expansion of this series of narrow rectangular waves. 
Since the value of sin wt at the rectangular areas is sin wta, we have, by 
taking N such correction rectangles, 


N 
Nx/wo k , 
Qio I sin? wt dt = 2A — sin? why 
0 WO 
n=Q 


But 
Nr/wo Nr/eo 
i sin? wt dt = if (1 — cos 2wt) dt 
0 0 
= ee — sin (2N 2) 
2 wo 4 0 
and 
N 


The sum remains finite as we increase N indefinitely. Therefore, by 
making N large, we have 


dio = 2 = (6.11) 
T 


Equations (6.10) and (6.11) give the two important coefficients ao1 
and aw for small k. For general values of k, these coefficients were 
computed by R. M. Kalb and W. R. Bennett. When 0 <k < 1, 


doi = ee E(k) 


(6.12) 
mu = “2 (BQ) — 1 — YK) 


where K(k) and E(k) denote the complete elliptic integrals of the first 
and the second kinds, respectively. For k small, the elliptic integrals 
can be expanded; then 


1 Bell System Tech. J., 14, 322-359 (1935). 
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= k? ) 

ee ee ea 

er (6.18) 
oo = AF 4 E+ ) 


Equation (6.13) shows that our simple computation is correct within 
the accuracy of analysis. However, it also shows that the simple results 
of Eqs. (6.10) and (6.11) are accurate enough for even moderate values 
of k. Therefore the ratio of the component of frequency w in the output 
to the component of the same frequency in the input, t.e., the frequency 
response F,(iw), is approximately equal to 

Fi(tw) = Za (6.14) 
As shown by Has. (6.10) and (6.13), when k is small, the amplitude of 
the component of frequency wg in the output is approximately a constant 
determined entirely by the properties of the relay. Also, the ratio of 
the component of frequency wo in the output to the component of the 
same frequency in the input is 4A4/(rE)). Thus the amplification of 
the relay for the component of frequency wo is 6 db greater than the 
amplification for the component of frequency w. 

Now it is easily seen that the preceding considerations can be extended 
to the case in which we have, instead of the signal ££) sin wt, a signal 
z(t) of arbitrary form whose magnitude is much smaller than Fy, the 
amplitude of the persistent oscillation. The essential result can be stated 
as follows: If we ignore higher-order modulation terms for the reason 
that they are negligibly small, or that they are to be suppressed ulti- 
mately by suitable filtering, the relay with the input 


E, sin Wot + x(t) 


where z(t) is small compared with Eo, behaves, as far as the transmission 
of the signal z(t) is concerned, substantially as a linear system having a 
constant frequency response given by Eq. (6.14). 

6.5 Oscillating Control Servomechanisms with Built-in Oscillation: 
We are now ready to discuss the oscillating control servomechanism. 
We have already seen that the only effect of the persistent oscillation, as 
far as the transmission of signals is concerned, is to make the relay 
into an effectively linear element, with a positive real frequency response. 
Hence we might have considered the relay to be such an element from 
the beginning, avoided all explicit mention of the oscillation Eo sin wot, 
and dealt with the system entirely by means of the concepts and methods 
given in the earlier chapters. This is, in fact, the novel and excellent 
procedure proposed by J. C. Lozier. 
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For the sake of simplicity, we have assumed so far that the inherent 
properties of the servo afford all of the filtering that is necessary to sup- 
press the unwanted modulation terms introduced by the relay. It is at 
least conceivable that, in practice, it may sometimes be necessary to 
supply additional filtering by means of supplementary filters. Natu- 
rally, whatever effective filters there may be in the system must be 
such that they pass the wanted signals. This, in combination with our 
other considerations, implies that the frequency wo must be above the 
important parts of the Fourier spectra of the signals. 

No matter what filtering we may introduce into the system, the out- 
put will always contain, as one component, an oscillation having the 
frequency wo. It is worth remarking that it may not even be desirable 
to reduce the amplitude of this component below a certain level, by 


Feedback 
Circuit 


Fra. 6.3 


filtering. In fact, such an oscillation furnishes “dynamical lubrica- 
tion,” which diminishes the effects of static friction, backlash, and 
other parasitic nonlinearities tending to degrade the performance of 
servomechanisms. 

We have not said anything very specific about the way in which 
the oscillation Æo sin wot is supplied to the relay; we have merely remarked 
incidentally that it may be supplied by a subsidiary oscillator. Systems 
in which the oscillation is supplied in that way have certain advantages 
in the way of flexibility. However, they have the disadvantage of 
involving a certain amount of extra equipment. We shall now give 
a brief description of a variety of oscillating control servomechanisms in 
which the servomechanism itself is made to supply the oscillation. 

Consider the system sketched in Fig. 6.3. Suppose that the system 
is so designed that in the absence of the input signal it oscillates at a 
frequency wo determined by the phase shifts of the linear elements in the 
feedback loop. As we have seen, the relay behaves, as regards the 
oscillation, as an effectively linear element, having a frequency response 
which decreases as the amplitude is increased. The amplitude of the 
oscillation adjusts itself so that the amplification around the loop, 
determined by the amplifications through the relay and through the 
linear elements, is unity. 
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Now suppose that the system is subjected to an input signal. If the 
corresponding error signal at the input of the relay is sufficiently small, 
the amplification of the relay for the persistent oscillation is substantially 
unaffected, and the system continues to oscillate at substantially the 
original frequency and amplitude. As we have seen, the relay behaves, 
as regards the signals, as an effectively linear element, having an ampli- 
fication which is 6 db less than the amplification for the persistent oscilla- 
tion. It is clear that under these conditions we have an oscillating- 
control mechanism such as we have discussed above. The only novelty 
in the present situation is the fact that the frequency and amplitude 
of the persistent oscillation Eo sin wot are determined by the system 
itself, instead of being determined independently, as we have tacitly 
assumed heretofore. 

In all our considerations of the system as a servomechanism, we need 
only ascribe the proper effective frequency response to the relay, and 
then proceed in the ways described in the preceding chapters. We do 
not need to take account explicitly of the persistent oscillation. How- 
ever, the requirement that the system shall also function as an oscillator 
imposes certain restrictions on what we can do toward improving its 
performance as a servomechanism. ‘This can be seen as follows. 

Let F(s) denote the transfer function of the feedback loop for the 
signals computed by using Eq. (6.14). Then the transfer function for 
the persistent oscillation is 2F'(s); and, by the very fact that the system 
does oscillate, there is a purely imaginary root s = tw, for the system 
transfer function 1 + [1/2F(s)|. Therefore 


2F (iwo) = —] 


Hence, the curve 1/F (s) in the Nyquist diagram, as s traces the imagi- 
nary axis, is constrained to pass through the point —2. On the other 
hand, in order that the performance of the system as a servomechanism 
shall be satisfactory, the curve must meet the conditions we have dis- 
cussed in Chap. 4, including the condition of avoiding the neighborhood 
of the point —1. Obviously, the constraint to which the curve is sub- 
jected makes it more difficult to meet these conditions than it is in the 
other systems, where no such constraint exists. In this sense, these self- 
oscillating servomechanisms are less flexible than are oscillating control 
servomechanisms in which the oscillation is supplied by an independent 
generator. 

An elementary precaution to be observed, in order that the curve, 
which is constrained to pass through the point —2, shall avoid the 
neighborhood of the point —1, is that the curve should intersect the 
real axis at the point —2 perpendicularly. This implies that the vector 
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1/F (iw) should be varying slowly in magnitude and rapidly in angle at the 
frequency at which the system oscillates. 

6.6 General Oscillating Control Servomechanism. A relay is a non- 
linear device. But by mixing the signal with a sinusoidal oscillation of 
high frequency and large amplitude, the output is made to be linear with 
respect to the signal. Thus the essential concept of oscillating control 
servomechanisms is the linearization of a nonlinear system. J. M. Loeb! 
has shown that this concept is applicable to any nonlinear system, and 
he calls this method the general linearizing process for nonlinear control 
systems. We shall call the resulting servomechanism the general oscillat- 
ing control servomechanism. 

Let us consider a general function y(x), where y is the output and x 
is the input. If for the variable x we substitute the sum x + e, where e 
is much smaller than zx, then, if the function y(x) is regular, we can expand 
y(x + e) into a Taylor series as 


ylz + ș) = TER- ay +é 5 (24) ER (6.15) 


We now specify the input x as a periodic function of time ¢ with the 
period T, and «as a constant. Then itis clear that y(x) is also a periodic 
function of time with the same period T. The same is true for dy/dx and 
d?y/dx?. Periodic functions can be expanded into Fourier series; thus 
if we neglect powers of e higher than the first, we have 


[na] 


y(x + e) = Goo + y (don COS nwt + bon sin not) 


3 
It 
ro 


+ e| an + | (Qin COS nwt + by, sin nat) | (6.16) 
n=] 


where w = 2r/T, the frequency of the input z. 

If e is not exactly a constant but a slowly varying function of t such 
that its fundamental frequency is very much lower than w, then Eq. 
(6.16) is still approximately correct. Now consider y(x) as the output- 
input relation of the nonlinear device, e(é) as the signal, and x(f) as the 
superimposed high-frequency, large-amplitude oscillation, not necessarily 
sinusoidal. The signal information in the output of the nonlinear device 
is represented by the second term of Eq. (6.16). Since the frequency w 
is much higher than that of e(t), the periodic function represented by the 
Fourier series 


+} 


Gig + > (Qin COS nwt + bin sin nwt) 


n=] 


! J. M. Loeb, Ann. de Télécommunications, 5, 65-71 (1950). 
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can be considered as the carrier, while e(t) is the modulating function. 
While the above discussion is based upon a direct relation y(x) between 
the input and output of the nonlinear device, Loeb has shown that Eq. 
(6.16) is also true for the more general functional relation between y and z, 
that is, y at t is dependent not only upon the instantaneous z at the same 
t, but also upon all past values of z. This extended concept of input- 
output relation includes hysteresis effects such as gear blacklash, and 
is sufficient for almost all nonlinear devices in practice. Therefore, for 
a general oscillating-control servomechanism, the signal in the output 
of the nonlinear device appears as a modulated carrier wave. Moreover, 
the input-output relation is linear as far as the signal is concerned. 


Fic. 6.4 


Now let us assume that the superimposed oscillation has a symmetrical 
wave form such as the sine wave or the saw-tooth wave shown in Fig. 6.4. 
Then if y(x) is even, or 

y(x) = y(—2) 


dy\ __ (W (6.17) 
od a 


we have the following relations for the periodic function y(x) and dy/dz 


y(t): = yle) r | 


| | (6.18) 
=o @-@ 
dx t dz t> 
These requirements then specify 
dor = bo = Q, dio = 0 fory even (6.19) 


Therefore, if the higher harmonies are neglected, the carrier is a sinusoidal 
oscillation of frequency w. This is the case of the a-c servomechanism 
discussed in the previous sections. The design method described there 
can then be applied to this class of general oscillating control servo- 
mechanisms. If y(x) is odd, or y(x) = —y(—2z), then a set of conditions 
similar to Eqs. (6.17) and (6.18) can be written down, and 


Qo = 0, ay = by = 0 for Y odd (6.20) 
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By neglecting higher harmonics, the case is thus identical to the oscillat- 
ing control servomechanism discussed in Sec. 6.4. 

The preceding discussion shows that the characteristics of a nonlinear 
component in a servomechanism can be linearized by the technique 
of adding a persistent oscillation to the input and thus converting the 
system into an oscillating control servomechanism of improved per- 
formance. Furthermore, such servomechanisms can be designed with 
methods already explained in this chapter. 


CHAPTER 7 


SAMPLING SERVOMECHANISMS 


All the servomechanisms that we have considered so far are designed 
to deal with signals which are given as functions of the continuous 
variable ¢ There are situations, however, in which the signals which a 
servomechanism has to deal with are given as functions of a discrete 
variable. Such a situation arises, for example, when we have an input 
signal which has been obtained by determining the values of a function 
z(t) at equally spaced instants 0, to, 2to, . . . . In such a case, the input 
signal is not defined at all in the open intervals between the successive 
sampling instants. 

Naturally, when we have a situation of the kind just described, we 
are interested in the values of the output signal at the sampling instants. 
Consequently, the servomechanism should function so that the correc- 
tive effect which it applies to the output signal is governed only by 
those values and not by the values which the output signal may have 
during the intervening intervals. A servomechanism which is designed 
to function in this way may be called a sampling servomechanism. In 
this chapter, we give a brief account of a theory of linear sampling servo- 
mechanisms which is very similar in its point of view and procedure 
to the theory of continuously operating servomechanisms that we have 
been discussing in the preceding chapters. 

7.1 Output of a Sampling Circuit. The prototype sampling servo- 
mechanism which we shall consider is shown in Fig. 7.1. The system 
contains the usual forward and feedback circuits. The essential novelty 
of the system lies in the fact that the feedback path contains a switch, 
which is operated periodically so that the feedback loop is closed only 
during short time intervals located at the equally spaced instants 0, to, 
2t, .... The location of the energy-storing, or frequency-selective, 
elements in the system affects the theory in matters of detail only. 
Hence we take the opportunity to simplify the exposition somewhat by 
assuming that the transfer function of the forward circuit is independent 
of the frequency. Then it is essential that the switch be placed in the 
position shown. 

The following analysis is based upon the assumption that the intervals 
during which the switch is closed are so short that the feedback circuit 
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can be considered to be subjected to a sequence of impulses. It is also 
based upon the assumption that the response h(t) of the feedback circuit 
to an impulse is a continuous function of time. This means that for 
small values of ż, ho(t) behaves like t”, where n > 1. Then h(t) has no 
jump in value at ¢ = 0. If F(s) is the transfer function of the feedback 
circuit, then according to the general formula, Eq. (2.18), 


y+ti© 
haf) = = | _, Fae" ds (7.1) 


where y is a real constant which is greater than the real part of any pole 
of F(s). If for large values of s, F(s) behaves like 1/s”, then according 


y(t) 


Periodically 
Operated Switch 


Feedback Circuit 


Fig. 7.1 


to the “dictionary” of Laplace transforms, Table 2.1, he(t) will behave 
like 7-1! for small t. Our condition for the continuity of he(t) at t = 0 
thus requires m to be at least 2. Thus for large s, F(s) approaches zero 
at least as rapidly as 1/s?. 

Now it is clear that if the input signal z(t) vanishes identically for 
negative values of t, the value of the output signal y(t) at the typical 
sampling instant nto is computed as the sum of the effects of all previous 
impulses and is given by the formula 


y(nt) = F; | (nt) — big du (Ietéo)he(nty — kto) | (7.2) 


where F, denotes the transfer function of the forward circuit, a constant, 
and @ is the fraction of the switching cycle during which the switch is 
closed. Ot y(kéo) is thus the “impulse” input to the feedback circuit at 
t = kto. 

When z(t) and h(t) are known at the sampling instants, the values 
of y(0), y(to), yio), . . . can be calculated successively by Eq. (7.2) 
in an elementary way. However, instead of proceeding in that way, we 
shall follow a more illuminating course, which will bring the theory of 
the sampling servomechanism into a form similar to that of the theory 
of the ordinary servomechanism discussed in Chap. 4. This approach 
is due to G. R. Stibitz and C. E. Shannon. 
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7.2 Stibitz-Shannon Theory. Let us write 


(ea) 


X*(s) = x (nto) e778 (7.3) 
2. 
Y*(s) = Y ylnte)em* (7.4) 
and D 
F3(s) = to ) hante" (7.5) 
2, 


These functions are thus periodic functions of s, with the imaginary 
period 2r7/t. The functions of nto are thus the Fourier coefficients. 
The step of going from the functions z(t), y(é), and haft) to X*(s), Y*(s), 
and F¥(s) is very similar to the formation of the corresponding Laplace 
transforms X(s), Y(s), and F(s), as indicated by Eq. (2.1). Here the 
continuous time variable is replaced by the discrete time instants nto, 
and thus the integral sign is replaced by the summation sign. Therefore 
Eqs. (7.3) to (7.5) represent the natural adaptation of the Laplace- 
transform technique to the problem of the sampling servomechanism. 

For the time being we shall confine our attention to the case in which 
all of the poles of Fə(s) lie to the left of the imaginary axis. Then the 
function As(t) ultimately decays exponentially as ¢ tends toward infinity, 
and the series in Eq. (7.5) converges for all values of s with real parts 
which are greater than a certain negative constant. Of course, the con- 
vergence of the series in Eqs. (7.3) and (7.4) depends upon the nature 
of the input signal. We restrict our attention to input signals for which 
the series converge in the same manner as the series in Eq. (7.5). This 
amounts only to the mild sort of restriction on z(t) that we are accustomed 
to assume in transient theory. 

Multiplying Eq. (7.2) through by e and then summing over all 
values of n, we obtain 


a 


Y*(s) = F, | X*(s) — By ` enntus y y(ktp)ho(nty — lt) | 


n=0 k=0 
But 
Ato X gortos ` y(kto)hs(nto — kto) 
n=0 k=0 
Ga n 
= bto D > y (kto jetese- Dtsha(nto — kto) 
n=0 k=0 
= Gly ) y y(ktoje tse meh (mto) 
m=0 k=0 


OF *(s) ¥*(s) 


I 
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The step of changing the summation over n and k to a summation over k 

and m = n — k is indicated in Fig. 7.2, where the shaded region is the 

region of summation. Therefore 
Y*(s) = Fi[X*(s) — OF 3(s) ¥*(s)] 


—— > (n—-k)=m 


or 
Y*(s) F, 
Fo area 9 
$ Equation (7.6) is the analogue of the 


Fre. 7.2 basic equation (4.3) for feedback servo- 


mechanisms, discussed before. What difference there is between the cases 
lies in the analytical natures of the functions involved. We shall discuss 
this point later. 

Now let us assume that the character of y(nto) is such that the series 
of Y*(s) is convergent for values of s with nonnegative real parts. Then 
the series is convergent for purely imaginary s. Lets = tw. Then 


y* (iw) einige = > y (mt 0) giw (m~n) 
m= 
Therefore 
wot (2r/to) : l Oar 
i Y*(iw)e dw = y(nto) 
0 


GQ 


or 


to wot (27/to) 
y(nto) = 4 l Y* (tu) eos dey 


0 


By putting ww = s and iwo = So, we have, finally, 


ty so+ (2ri/to) 
ynt) = = [ Y*(s)e” ds 


0 


By the Cauchy theorem of complex integration, 


to [ FiX*(s)ente 
= * ntos eo coined mneritenatiosirenreemrmaerermtenrtnintnanenewvase, 
y (nto) z l Y*(s)em* ds ori fe VF OPES) ds (7.7) 


where, as shown in Fig. 7.3, T is a path of integration joining two points 
separated by the distance 2r/to on the imaginary axis in the s plane, and 
passing to the right of all singular points of the integrand. This descrip- 
tion of T is now in such a general form that if Y*(s) has poles with posi- 
tive real parts, Eq. (7.7) is still true. 

Because of the periodicity of X*(s) and F¥(s), we can add to T the 
dotted lines parallel to the real axis and leave the value of the integral 
unchanged. The combined path then encloses all the poles of the 
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integrand. But it can be shown that for reasonable input X*(s) has 
no poles with positive real parts. Then the only possible source of 
unstable output is a zero of the denominator of the integrand of Eq. (7.7) 
with a positive real part. Thus, in a way very similar to the require- 
ment on conventional servomecha- 


nisms, the necessary and sufficient oni s - plane 
condition for stability is that the ~*~~~77777 sot 
equation 
1 + 6FiF3(s) = 0 (7.8) 
shall have no roots in the right half of A 


the s plane. We shall now show how 
we can implement this condition by 
an appropriate adaptation of the tie. 72 
Nyquist criterion of Sec. 4.3. a 

7.3 Nyquist Criterion for Sampling Servomechanisms. Because of 
the periodicity of F¥(s), it suffices to determine whether or not Eq. (7.8) 
has any roots in a horizontal half strip, of width 2r/to extending to the 
right from the imaginary axis. We are assuming that Fž(s) has no 
singular points on or to the right of the imaginary axis, and we also assume 
that 1 + 6FF*(s) has no zero on the imaginary axis. We can, and do, 
assume that the half strip is adjusted vertically so that 1 + 6F iF f(s) 
has no zeros on the horizontal sides. Now let the point s describe the 


B 


C 
Fra. 7.4 


closed curve ABCDA of Fig. 7.4. Then the corresponding tip of the 
vector 6FiF#(s) describes a certain closed curve, such as A’B’C’D’A’ 
shown in Fig. 7.5. We do not try to show the curve described by the 
vector realistically. When s describes AB, we have the are A’B’. 
When s describes BC, we have an arc B’C’; and, because of the periodicity 
of F*(s), B’C’ is a closed curve. When s describes CD, we have an arc 
C’D’; and, because of the periodicity of F}(s), C’D’ coincides, except for 
sense, with A’B’. Finally, when s describes DA, we have the are D'A’, 
which is a closed curve. 
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By Cauchy’s theorem, Eq. (7.8) does, or does not, have roots in the 
rectangle ABCD according to whether the radius vector from the point 
—] to the running point 0F F$ (s) does, or does not, make a nonzero 
net number of revolutions as the running point describes the curve 
A’B’C’D' A’, Now consider what happens when the side BC of the 
rectangle in Fig. 7.4 recedes to infinity. It is easy to see from Eq. (7.5) 
that the closed curve formed by the are B’C’ in Fig. 7.5 shrinks to a single 

i point. Therefore the effective curve 

OF F2(s)-plane is the are D'A’. Obviously, Eq. 

(7.8) does, or does not, have roots in 
the half strip according to whether 
the radius vector from the point —1 
to the curve does, or does not, make 
a nonzero net number of revolutions 
as the limiting curve is described. 
This is the Nyquist criterion for 
simple sampling servomechanisms. 

We have now given the fundamen- 
tals of the Stibitz-Shannon theory 
of sampling servomechanisms, which is very similar, both in its point 
of view and in its form, to the theory of servomechanisms with continuous 
operation. 

74 Steady-state Error. If the input is a unit step function, 


Fig. 7.5 


%W (e a] 


X*(s) = X e zZ D (eten = —_ 


n=0 n=0 


then, according to Eq. (7.7), 


(nto) = 2 jl m a 
Ww Ont Je eF FO] 


As n— œ, the only pole of importance is at the origin, 


' F 
ET AEO A 


This equation gives the “steady-state” output for a constant input of 
unit magnitude. Therefore the condition for small steady-state error is 


FP, = 1+ 0FF3(0) 
or 
] 


Pi © TERR) 


(7.10) 
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This gives the approximate magnitude of the gain for the forward circuit 
if the output is to follow the input accurately. Equation (7.10) for 
sampling servomechanisms is the analogue of Eq. (4.11) for continuous 
servomechanisms. 

7.5 Calculation of F¥(s). We have one more step to take before our 
theory of sampling servomechanisms can be regarded as being of much 
practical value. Both of the functions F(s) and F3(s) serve to character- 
ize the feedback circuit: F(s) is the significant characteristic when the 
circuit is used as part of a continuously operating servomechanism; and 
F¥(s) is the significant characteristic when the circuit is used as part of a 
sampling servomechanism. The familiar function F.(s) is mathe- 
matically much the simpler of the two, and furthermore it is the function 
which is used directly in our common techniques for designing circuits. 
It is important, therefore, that we relate F$(s) to F(s), and in as direct 
a manner as possible. We note again that Fy(s) is periodic in s with 
the imaginary period 727/t). Furthermore, the analysis of the perform- 
ance of the system by the Nyquist method requires the “frequency 
response” F*(zw) only for —r/ti < w < r/to. 

Assuming that the real part of s is greater than y, which according 
to our assumptions is a negative number, we have, by Eqs. (7.1) and 
(7.5), 


to Ytin 


— F(q) dq , grisa) = 


to o Fa(4) dg 


2ri 1 = emed 


(7.11) 


—~%4~ 
n=0 


We proceed to evaluate the right-hand member of (7.11) by the method 
of residues. 

The integrand has certain poles: the poles of F2(s) lie to the left of the 
path of integration, and the poles which are the roots of the equation 
1 — e-a = 0 lie to the right of the path of integration. It is easily 
seen that the integration upward along the line y — tœ to y +12 is 
equivalent to integration in a clockwise direction along the closed curve 
formed by that line and the infinite semicircle in the right-half plane. 
Hence the right-hand member of Eq. (7.11) is —é times the sum of the 
residues of the integrand with respect to the several roots of the equation 
1 — eed = 0. 

Now the typical root of the equation is g = s + (Qrim/t.), where m 
is an integer and the residue of the integrand with respect to that pole 
is —(1/to)F2[s + (2rim/to)]. Therefore, finally, 
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= 6 


F¥(s) = >, F, (s+ 2) (7.12) 


= 


This formula gives considerable insight into the properties of F}(s) and 
at times may be useful in making approximate calculations. However, 
we can easily obtain an exact representation of F}(s) in finite form. 

The function F2(s) can be represented as the sum of a finite number of 
partial fractions, thus: 


F,(s) = 2 va (7.13) 


S$ — & 
k=1 


where the a,’s and the s,’s are constants and n is the degree of the denom- 
inator polynomial of F(s). Consequently, we can write, by using Eq. 
(7.12), 


= t 2(8 — Sz) 
p » ý f — $k j », amo (7.14) 


k=] m=) 


Now it is known that coth z has the following expansion: 


[<9] 


l 1 
cothe=+ +2) a 


m=] 


Therefore the sum over m in Eq. (7.14) can be carried out, and we have 


F*(s) = > >, a, coth ee (7.15) 


k=] 


By means of this formula we can compute F3(s) exactly for any value 
of s. . 
When t is very small and F¿(iw) is negligibly small outside of the 
interval —r/to < w </to, the qualitative nature of F*(iw) is immedi- 
ately apparent from Eq. (7.12). In fact, F* (iw) is approximately equal, 
in the interval —r/to < w < /to, to the function Fz(iw). We shall now 
see that when to is large we can obtain an equally simple approximation 
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to Fo(iw). Let us write the roots s, as 
Se = — À; + Tk (7.16) 


where the \’s and w’s are all real. In accordance with our assumption, 
the d’s are all positive. Now we have, according to Eq. (7.15), 


n 


F¥(iw) = z ` a, coth i As + ilw — w] 


k=1 
n 
to 1 + eto Axti(w—o3)] 
= 5 Ak J — en tole ti(e—ax)] ` 
k=1 


Therefore, for large values of to, 


n 


F¥ (tw) = a ` ar {1 + 2e Pitio] } (7.17) 


k=] 


When s is large, Eq. (7.13) can be written as 


1 ] 
Fa(8) -1X ath) an+ soso 
k=1 k=1 


But we have assumed as a condition for continuous response of the feed- 
back circuit to an impulse that Fa(s) ~ 1/s? when s is large. Therefore 


a = 0 (7.18) 
Then Eq. (7.17) becomes 
F¥ (ia) = tge > yet (7.19) 
k=1 


For physical systems, the s;’s are real or form pairs of complex conjugates. 
Therefore the finite sum in Eq. (7.19) is actually real, and the graph 
of F% (tw) as w goes from —2/to to t/t is a circle with the radius 


n 
Sa 
k=1 


7.6 Comparison of Continuously Operating with Sampling Servo- 
mechanisms. For small tọ we have seen that F}(tw) is approximately 
Fe(iw). The stability criterion for the continuously operating servo- 
mechanism is that the curve F\F2(tw) should avoid the point —1. For 


(7.20) 
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the sampling servomechanism, the curve 6F,F3(iw) should avoid the 
point — 1, or the curve F\F (iw) should avoid the point 1/0. Therefore, 
if stability is the only consideration, the sampling servomechanism can 
have much larger gain than the conventional servomechanism. 

For large values of to, because of Eqs. (7.19) and (7.20), the Nyquist 
criterion becomes simply 

n 
to » Qet osk 


k=1 


OF: <1 


Since s, has a negative real part, the radius of the Fž(îw) curve is very 
small. This fact together with the smallness of 0, the fraction of time 
when the switch is closed, allows very large gain for the forward circuit 
without instability. Thus for any value of the switching period to, the 
condition for stable operation of a sampling servomechanism is very 
much less stringent than that for a continuously operating feedback 
servomechanism. Perhaps this is to be expected, because the time 
interval when feedback of the output actually occurs is very brief, and 
no restriction is imposed on the output other than that at the switching 
instant. 

7.7 Pole of F.(s) at Origin. In practice, the function F(s) is quite 
likely to have a pole at s = 0. So far we have excluded this case from 
consideration, in order to avoid having to deal with certain minor com- 
plications. However, we shall now consider the case briefly. 

In the first place, we observe that when s = 0 is a pole of F.(s) the 
constant y must be positive and that our representation of F¥(s) in 
terms of infinite series is valid only for values of s with positive real 
parts. In the second place, the Nyquist diagram for the system also 
undergoes changes. Specifically, instead of getting an actually closed 
curve, we get an open curve, the ends of which are to be regarded as 
being joined by an infinite semicircle in the clockwise sense. Our 
representation of Fž(s) in finite form, however, remains valid. If we 
set sı = 0, then Eq. (7.15) gives 


Fi(ia) = | ~ias ot $+ Yo coth e+ He = a] 
k=? 


For żo large, we obtain an expression similar to Eq. (7.17), i.e., 


F3(iw) = h| io cot = F ` a {1 + destino 
k=2 


But according to Eq. (7.18), 
aa t as + eoe +a, = —ay 
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so that 


Fi(ia) = — = £ + i cot | + toe ` Ape oE (7.21) 


k=2 


The constant a1 is, of course, real and positive. As w varies from —r/to 
to +7/to, the first term gives a vertical straight line parallel to the 
imaginary axis. The other part of Fž(iœ) is a sinusoidal function. 
Hence the Nyquist diagram is a sinuous variation of a straight line. 


CHAPTER 8 


LINEAR SYSTEMS WITH TIME LAG 


Tn this chapter we shall introduce another new element into our linear 
systems with constant coefficients: the time lag. By time lag r we mean 
that the relation between the different variables of the system cannot 
be expressed as a relation of these variables all taken at some time instant 
i; but on the contrary, the relation involves some variables taken at the 
time instant t, and some taken at an earlier instant’ — 7. Those taken 
at the instant ¢ — r then lag by the interval r behind the variables taken 
at the instant t. This time lag is thus quite different from the character- 
istic time constant of the first-order linear system introduced in Sec. 3.1. 
Time-lag systems are represented by differential difference equations of 
constant coefficients and are more complex than the linear systems 
studied previously, which are represented by differential equations. 
Systems with time lag were studied by many investigators: for instance, 
A. Callander, D. Hartree, and A. Porter,! and N. Minorsky.? Our inter- 
est here is, however, somewhat more restricted. We wish to know: 
How can we analyze the performance of a feedback servomechanism 
if there is a characteristic time lag 7 in the system? We wish, specifically, 
to modify the Nyquist method of Sec. 4.3 to apply to time-lag systems. 

We shall develop the theory by treating a particular example of such 
systems, namely, the example of stabilizing the combustion in a rocket 
motor by feedback control. The problem of combustion instability in 
rocket motors has been treated by many authors, but the following 
analysis of combustion lag time originates from the work of L. Crocco.’ 
For simplicity of calculation,* we shall consider only the case of so-called 
low-frequency oscillation in a rocket motor using a single liquid propellant. 

8.1 Time Lag in Combustion. Let m,(t) be the mass rate of genera- 
tion of hot gas by combustion at the time instant t. The mass rate of 
injection at ¢ can be denoted by m(t). Let z(t) be the time lag for that 


1 A. Callander, D. Hartree, and A. Porter, Trans. Roy. Soc. London (A), 235, 415-444 
(1935). 
2 N. Minorsky, J. Appl. Mechanics (ASME), 9, 67-71 (1942). 
3 L. Crocco, J. Am. Rocket Soc. 21, 163-178 (1951). 
t The following discussion is based upon a paper in J. Am. Rocket Soc., 22, 256-262 
(1952). 
94 


LINEAR SYSTEMS WITH TIME LAG 95 


parcel of propellant which is burned at the instant t Then the mass 
burned during the interval from ¢ to t + dt must be equal to the mass 
injected during the time £ — r to t — r + d(t — 7). Therefore 


malt) dt = mult — r)d(t — 7) (8.1) 


The mass of hot gas generated is either used to fill the combustion 
chamber by raising its pressure p(t) or is discharged through the rocket 
nozzle. If the frequency of the possible oscillations within the chamber 
is low, then the pressure in the chamber can be considered uniform, 
and, as a first approximation,’ the flow through the nozzle can be con- 
sidered quasi-stationary. Therefore the mass rate of discharge through 
the nozzle is proportional to the density of hot gas in the rocket motor. 
But for a “‘monopropellant” rocket motor, the temperature of the hot 
gas is nearly independent of the combustion pressure, and the density 
of the hot gas is proportional to pressure only. Thus if m is the steady 
mass rate of flow through the system, M, is the average mass of hot gas 
in the motor, p is the steady-state pressure in the combustion chamber, 
and if the volume occupied by the unburned liquid propellant is neglected, 


we have 
tin dt = Th (2) dt +d (1, 4 (8.2) 


We now introduce the nondimensional variables » and u for the 
chamber pressure and the rate of injection, respectively, defined as 
p— P _ ms — ™ 


202 8.3 
9 3 H = (8.3) 


y and u are then the fractional deviations of the pressure and injection 
rates from the average. With Eq. (8.3), 7 can be eliminated from 
Kas. (8.1) and (8.2), and 


Mee peta (1-H) ue—n4u (8.4) 


To calculate the quantity dr/dt, Crocco’s concept of the pressure 
dependence of time lag has to be introduced. If the rate at which the 
liquid propellant is prepared for the final rapid transformation into hot 
gas is a function f(p), then the lag r is determined by 


Í, S f(p) dt = const. (8.5) 


This constant can be thought of as the amount of heat that has to be 
added to a unit mass of the cold injected propellant before “ignition” 


1 H., S. Tsien, J. Am. Rocket Soc., 22, 139-143 (1952). 
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occurs. Then f(p) has the physical meaning of the rate of heat transfer 
from the hot combustion gas to the injected liquid propellant. By differ- 
entiating Eq. (8.5) with respect to ¢, 


Hk = ole ( : r) -0 


We can now introduce explicitly the concept of a small perturbation from 
the uniform steady state. Assume that the deviation of the pressure p 
from the steady-state value # is small. Then f(p) at the instant t and 
f(p) at the instant ¢ — 7 can be expanded as Taylor’s series around ĝ. 
By taking only the first-order terms, 


(mh = KO) + p (2) oli) 
ol = 10) +7 p o(t — 7) 


Here 7 is the lag at the average pressure p, a constant now. Then 


dr d log f _ B 
1-H -14 (GEL) eO- et- 868 
By combining Eqs. (8.4) and (8.6), the following equation is obtained: 
d 
£ + = ule — 8) +nlvl2) — ole - 8) (8.7) 
where 
_ (dlog fs ) 
= k log p/p=3 ee) 
and ` 
_ Ms zt mE 
0, = ai. = ð, ô = 6, (8.9) 


6, is thus the average gaseous mass in the motor divided by the average 
rate of mass flow through the motor, and is thus the average time between 
the instant of production by combustion of the hot gas to the instant 
of discharge through the nozzle. 9, is therefore called the gas transit 
time. We shall measure time by this fundamental time constant in 
the following calculations. zis the nondimensional time variable, and 
ô is the nondimensional constant time lag of combustion. 

If n is a constant independent of #, then f(p) is proportional to p”. 
This is the form of f(p) assumed by Crocco. The present formulation 
of the problem is slightly more general, in that f(p) is arbitrary and the 
value of n is to be computed by using Eq. (8.8) and is a function of @. 
If f(p) is considered as the rate of heat transfer from the hot combustion 
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gas to the propellant droplet, then the physical laws of heat transfer 
indicate a value for n between 5 and 1. 

8.2 Satche Diagram. Crocco called the instability of combustion 
with a constant rate of injection the intrinsic instability. If the injection 
rate is a constant not influenced by the chamber pressure p, then u = 0. 
Therefore the stability problem is controlled by the following simple 
equation, obtained from Eq. (8.7): 

de 
z T (L nel) + nee — ò) = 0 (8.10) 
Equation (8.10) could be treated by the Laplace-transform technique in 
the same way as equations without time lag discussed in the previous 
chapters. In fact this is the method used by H. I. Ansoff.! However, 
for the present problem of stability of combustion, the fundamental 
equation has no forcing term. Thus a more direct approach is that of the 
classical method of solving linear differential difference equations. That 
is, let 
plz) ~ e” 

then 

s+ (1 — n) + ne =0 (8.11) 


This is the equation for the exponent s. Stability of combustion then 
requires that the real part of s be negative. 

Equation (8.11) can also be obtained by applying the Laplace-trans- 
form method to Eq. (8.10). By multiplying Eq. (8.10) by e-* and then 
integrating with respect to z from z = 0 toz = œ, we have, noting that 
(s) is the Laplace transform of (z), 


sB(s) — o(0) + (1 — n)®(s) + |” o@ — de de = 0 
But 
i i olz — je dz = e-* Í . ple — de?) dz 
= gs) | ®(s) + ie olee” de! | 


Therefore, if the initial conditions are such that » = 0 for z < 0 for the 
so-called null initial conditions, then 


[s+ Q — n) + ne]8(s) = 0 


Hence we have Eq. (8.11). s has the same “‘meaning”’ as the variable s 
in the previous chapters. The only difference between the two is the 
fact that here s is made nondimensional by the transit time @,. It is 
also interesting to note that if Eq. (8.10) were a nonhomogeneous equa- 


1H. I. Ansoff, J. Appl. Mechanics (ASME), 16, 158-164 (1949). 
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tion with a forcing term at the right side, then after applying the Laplace 
transform to the equation, the resultant equation would also be nonhomo- 
geneous. ‘Then it will be seen that the function 


1 
~ s+ (—n) +ne* 


is the transfer function of the system, with (z2) considered as an output. 

F(s) is thus another example of a transcendental transfer function. 
Crocco determined the value of the complex root s by solving the set 

of two equations for the real and the imaginary parts of Eq. (8.11). 


F(s) 


Frc. 8.1 


However, if the point of interest is whether the system is stable or not, 
one can again use Cauchy’s theorem of Sec. 4.3 with advantage. Let 


l—-n s 
G(s) = e — (= A >) (8.12) 
Then the question of stability is determined by whether G(s) has zeros 
in the right half of the complex s plane. This question itself can be 
answered in turn by watching the argument of G(s) when s traces a 
curve enclosing the right-half s plane, as shown in Fig. 4.4. If the 
vector G(s) makes a number of complete clockwise revolutions, then, 
according to Cauchy’s theorem, that number is the difference between 
the number of zeros and the number of poles of G(s) in the right-half 
s plane. Since G(s) evidently has no poles in the s plane, the number 
of revolutions of G(s) is the number of zeros. Hence, for stability, the 
vector G(s) must not make any complete revolutions as s traces the 
specified curve. Therefore the stability question can be answered by 
plotting the Nyquist diagram. 

A direct application of this method to G(s) as given by Eq. (8.12), 
however, is inconvenient because of the complication caused by lag 
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term e~*, M. Satche! proposed a very elegant and ingenious method to 
treat such a system with time lag: Instead of treating G(s), break it into 
two parts, 


G(s) = gi(s) — ge(s) (8.13) 
where 

gi(s) = e= 

g(s) = — = z - (8.14) 


The vector G(s) is thus a vector with its vertex in gi(s) and its tail on 
go(s). The graph of gi(s) for s on the imaginary axis is the unit circle. 
For s on the large half circle, g(s) is 
within the unit circle. The graph 
of g2(s) is the straight line (Fig. 8.1) 
parallel to the imaginary axis when 
sis on the imaginary axis. When s 
is on the large half circle, go(s) is a 
half of a great circle closing the curve 
on the left. A moment’s reflection 
will show that in order for the vector 
G(s) not to make complete revolu- 
tions for any value of the time lag ô, Fro. 8.2 

the g(s) curve must le completely 

outside the gi(s) curve. That is, for unconditional intrinsic stability, 


l-n 


7 >1 or ¢>n>0 (8.15) 


It is easily seen now that the separation of G(s) into two parts gı(s) and 
go(s) allows a great simplification of the respective curves. The diagram 
of the combination gi(s) and ge(s) will be called the Satche diagram. 

When n > 4, the gi(s) and g(s) curves intersect. Stability is still 
possible, however, if for ge(s) within the unit circle of Fig. 8.2 gi(s) is 
to the right of g(s). This condition is satisfied if 


mina => = 


n 


ô < 6* 


sos (— = . i 
EAE |z s (: : »)| (8.16) 


y 2n — 1 2n — 1 


1 M. Satche, J. Appl. Mechanics (ASME), 16, 419-420 (1949). 
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When ô = 4*, then with 
w* = y 2n — 1 (8.17) 


G(iw*) = 0. Therefore, when ô = 6*, has an oscillatory solution with 
the frequency w*. 6* and w* are thus the nondimensional critical time 
lag and the nondimensional critical frequency, respectively. 

8.3 System Dynamics of a Rocket Motor with Feedback Servo. 
Consider now a system including the propellant feed and a feedback 
servo, represented by Fig. 8.3. In order to approximate the elasticity of 
the feed line, a spring-load capacitance is put at the point midway 
between the propellant pump and the injector. Near the injector there 


Line Capacitance 


Rocket Motor 


Fie. 8.3 


is another capacitance controlled by the servo. The servo receives its 
signal from the chamber-pressure pickup through an amplifier. If the 
feed system and the motor design are fixed by the designer, the question 
is whether it is possible to design an appropriate amplifier so that the 
whole system will be stable. Because there is no accurate information 
on the time lag of combustion, a practical design should specify uncon- 
ditional stability, i.e., stability for any value of time lag ô. 

Let mo be the instantaneous mass flow rate out of the propellant pump 
and po be the instantaneous pressure at the outlet of the pump. The 
average flow rate must be m. The average pressure is fo. The pump 
characteristics can be represented by the following equation: 

Po — Po Mo — M 

Ea mea (8.18) 
If the time rate of change of mass flow is small in comparison with speed 
of propagation of elastic waves in the liquid, but large in comparison 
with the slow rate of change of the rotating speed of the pump, a is 
simply the slope of the head-volume curve of the pump at constant speed 
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near the steady-state operating point. For conventional centrifugal 
pumps, a is approximately 1. For displacement pumps, «æ is very large. 
For the constant-pressure pump, or the simple pressure feed, œ is zero. 

Let mı be the instantaneous mass rate of flow after the spring-loaded 
capacitance, x be the spring constant of the capacitance, and p, the 
instantaneous pressure at the capacitance. Then 


. _ dp 
Mo ~ Ma =P (8.19) 
where p is the density of the propellant, a constant. 
In the following calculation, the pressure drop in the line caused by 
frictional forces will be neglected. Then the pressure difference py — pı 
is due to the acceleration of the flow only. That is, 


L dmo 


Po — Pi = oa Gp (8.20) 


where A, a constant, is the cross-sectional area of the feed line, and 1 
is the total length of the feed line. Similarly, if pə is the instantaneous 
pressure at the control capacitance, 


= l din, 
Pi — P = 57 ae (8-21) 
If the mass capacity of the control capacitance is C, then 
l , dC 
iy = M= (8.22) 


Since the control capacitance is very close to the injector, the inertia 
of the mass of propellant between the control capacitance and the 
injector is negligible. Then 


1 me 


where A; is the effective orifice area of the injector. 4A; can be eliminated 
from the calculation by noting that at steady state the difference of 
pressures fio and #, or Af, is 


Se. ca teil 

aa ae a ye (8.24) 

Equations (8.18) to (8.24) describe the dynamics of the feed system. 

By a straightforward process of elimination of variables, a relation 

between m,, p, and C is obtained. To express this relation in nondimen- 
sional form, the following quantities are introduced: 

ee _ 2AP eg. 
P=aip = = fe, J = pA, (8.25) 
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and 


k = 1 (8.26) 
mo, 


where 8, the gas transit time, is given by Eq. (8.9). Then the non- 
dimensional equation relating ¢, u, and «x is 


2 
Pliteme tps tries} et ator +ay 


2 
+ laE (P +4) +S + BP +h REG + EVES e 


2 d’ 
yla P+D +I HiP tD t T =0 (8.27) 


where z is the nondimensional time variable defined by Eq. (8.9). 

The dynamics of the servo control is specified by the composite of the 
instrument characteristics of the pressure pickup, the response of the 
amplifier, and the properties of the servo. Since we do not propose to 
discuss the detailed design of the feedback servo, the over-all dynamics 
of the servo control is represented by the following operator equation: 


d 
F a p =K (8.28) 


where F is the ratio of two polynomials with the denominator of higher 
order than the numerator. 

Equations (8.7), (8.27), and (8.28) are the three equations for the 
three variables y, u, and «. Since they are equations with constant 
coefficients, the appropriate forms for the variables are 


= aes? u = be* k = le” (8.29) 


By substituting Eq. (8.29) into Eqs. (8.7), (8.27), and (8.28), three 
homogeneous equations for a, b, and c are obtained. Thus we have 
als + (1 — n) + ne] — be = 0 
ae + of (P +4) s+ 4 JEst} a+ fu ta(P +h] 
+ [of (P +4) +J]s +B oJE (P +4) +4 JB) +3 Esl b 
sfa (P +4) + Js +} aJ E (P +4) è +4 Es} c =0 
F(sja-—c=0 


In order for a, b, and c to be nonzero, the determinant formed by their 
coefficients must vanish. This condition can be written as follows: 
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[s+ —n)] fee +$JE[l +a (P +3) 
$B P+) +I s+ tah +a 
+e f nJ*Es? + {}nJE (1 + a (P +p] +4 JEP} s* 
+ {n [aE (P +4) +J] +aEP (P +H} 
tfn + aP +H] Pl 


+ sF (s) fi PEs +3aJE(P +4) +Js+a(P + D} =0 
(8.30) 


This is the equation for determining the exponent s. F(s) is now recog- 
nized as the over-all transfer function of the feedback link. The com- 
plete system stability depends upon whether Eq. (8.30) gives roots that 
have positive real parts. 

8.4 Instability without Feedback Servo. The system characteristics 
without the feedback servo can be simply obtained from the basic equa- 
tion (8.30) by setting F(s) = 0. Let it be assumed that the polynomial 
multiplied into e: has no zero in the positive-half s plane, as is usually 
the case. Then Eq. (8.80) can be divided by that polynomial without 
introducing poles in the positive-half s plane into the resultant function. 
That is, for the Satche diagram, one has again 


G(s) = gi(s) — g(s) gals) = e* 
gi(s) is thus again the “unit circle.” ge(s) is now much more complicated: 


n) = ($445 


r) (PES +4 JE l +a(P HIL 
+ [ol (P +3) +s +i +a (PHD 
+ E J?Es? — 4 JE f1 +a (P +4) 4 (P/n)} s? 


+ {al (P + 4)f1 + (P/n)] + J} s+ fı ah pa) + (P/n)} 
(8.31) 


The intercept of go(s) when s is purely imaginary is given by setting 
s = 0 in Eq. (8.31), t.e., 


l-n 1+a(P +3) 


g2(0) = — “n ah en) (8.32) 


104 ENGINEERING CYBERNETICS 


Since all the parameters n, a, and P are positive, the magnitude of 
g2(0) is now smaller than the magnitude of g2(0) given by Eq. (8.14) 
for the intrinsic stability problem. Thus the effect of the feed system 
is to move the g(s) curve towards the unit circle of gi(s) in the Satche 
diagram. For instance, for n = 4, ge(s) is Just tangent to the unit 
circle for the intrinsic system without considering the propellant feed. 
But with the propellant feed system, the g(s) curve will intersect the 
unit circle, and the system will become unstable for time lag 6 exceeding 
a certain finite value. The influence of the feed system is thus always 
destabilizing. This is further confirmed by considering the asymptote 
of ge(s) for large imaginary s, obtained from Eq. (8.31). That is, 


i To l-n 2P 

go(tw) = — É F (= — 7) oF l for w| > 1 (8.33) 
Therefore, for large imaginary s, ga(s) approaches asymptotically a line 
parallel to the imaginary axis at a distance 


n Jn? 


to the left of the imaginary axis. The effect of feed system is again to 
move g(s) towards the unit circle. 

It is thus evident that for the parameter n near = or larger than 4, it 
would be impossible to design the system for unconditional stability. 
In the Satche diagram, the gi(s) and g(s) curves will always intersect 
without a feedback servo. 

8.5 Complete Stability with Feedback Servo. Ifthe polynomial H (s), 


H(s) = 3 J°Es' + {2 JE [1 + a (P +3)] + VEP/2n)} $ 
+ {aE (P +4) + (HPP + 4)}s + {1 +a (P +3) + Pm)| 
+ [sF(s)/nl{ 4 JES + }aJE (P +3) H Js +a (P+ A} (8.34) 


which multiplies into eè in Eq. (8.30) has no poles or zeros in the right- 
half s plane, then the presence of zeros of the expression in Eq. (8.30) 
in the right-half s plane can be determined from the Satche diagram with 


gi{s) = e* 


g(s) = -(£+1=*) [ves + 4780 +a(P+))] 8? 


+B (P+) + e+ tah +H +H) (838) 
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As s traces the path of Fig. 4.4, gi(s) is again a unit circle. Therefore, 
if simultaneously the go(s) curve is completely outside the unit circle, 
there can be no root of Eq. (8.30) in the nght-half s plane. In other 
words, if the transfer function F(s) of the servo-control link is so designed 
as to place the go(s) curve completely outside the unit circle (Fig. 8.4), 
then the system is stabilized for all time lags. 


| 
| 
| 
l-n l+a(P+4) 


lto(P+4)+£ 


2P _1- 
dn= n 


As an example, take 
n=; P=% J=4 H=4 c=l 


This value of a corresponds to a centrifugal pump for the propellant. 
Then without the servo control, go(s) is 


ie _ 1 Qs + IQs? + 3s? + 9s + 6) 
BO) E EBS? + 68 + 6 


Of primary interest is the behavior of g(s) when s is a purely imaginary 
number tw, for w real. Thus 


Shy a e + 4w) (6 — 3w?) + w?(21 — 8w*)(6 — w?) 
ge(ta) = — (6 — 3w + (6 — w?)? 
‘oy 2! — 8w’) (6 — 3w?) — (6 — 21w? + 404) (6 — w?) 
(6 — 3w7)? + (6 — w?)? 


This curve for w > 0 is plotted in Fig. 8.5. It is evident that for suff- 
ciently large values of time lag, the system will be unstable. On the 
other hand, if the go(s) curve can be changed by the servo control to, say, 


_g is + 2)(s + 3) 


gals) = (s + 6) 
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then, as plotted in Fig. 8.5, the new g curve is completely outside the 
unit circle of gi(s). Therefore the system is now unconditionally stable. 
A straightforward calculation from Eqs. (8.31) and (8.35) shows that the 
required transfer function F(s) for the servo link is 


(s + 1.0528) (s? + 0.71648 + 2.6304) 


Pls) = 4.815 FB) (6 +3) (0 + 0.5332) (6? + 0.46688 + 3.7511) 


The servo link has thus the character of an integrating circuit, first 
discussed in Sec. 3.3. If, with given response of the chamber-pressure 
pickup and of the servo for the control capacitance, an amplifier could be 
designed to give an over-all transfer function close to that specified above, 
the combustion could be stabilized by such a servo control. 

As the second example, take 


n=} P=% J=4 HE=4 a=0 


Since a = 0, the feed pressure po is thus constant and even when the flow 
of propellant varies. The case then corresponds to that of a simple pres- 
sure feed. Without the feedback servo, 


oe _— 1@s + 1s? + s + 8s + 2) 
IAS) 9 8 4 29? + ds $4 


When s is purely imaginary, 


»\ 1 (4 — 2w?)(2 — 17u? + 404) + w(t — w?)(12 — tw?) 
me) = E T T 
1. (4 — 2w*)(12 — 4w?) — (4 — a?) (2 — 17w? + 44) 
Page (4 — 2%)? + wo — w)? ~ 


This curve of g: is plotted in Fig. 8.6. It is evident that without servo- 
control the combustion will be unstable for sufficiently long time lag. 
In fact, the system is even less stable than the system considered in the 
first example: it will become unstable at shorter time lag. The part of 
the gz curve near w = 2 is of special interest. Near w = 2, the curve 
comes so close to the unit circle of gı that if the value of time lag ô is 
such as to make gı and gz for w ~ 2 very close to each other, then an 
almost undamped oscillation at w ~ 2 can occur. This critical value of 
ô is evidently smaller than the critical ô determined from the true inter- 
section of g, with the unit circle at w ~ 0.65. 

For unconditional stability, gə should be displaced out of the unit 
circle, to, say, the same “stable” curve as in the first example. The 
required transfer function F(s) is calculated to be 


(s + 0.8126) (s? — 0.04337s + 2.6506) 


F(s) = —4.875 s*(s + 2)(s + 8)(s? + 4) 
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The required servo link must then have the character of a double inte- 
grating circuit. Furthermore, the transfer function has two purely 
imaginary poles at £22. This unrealistic requirement on the amplifier 
comes from the original feed-system dynamics and is due to the neglec- 
tion of frictional damping in the feed line. In any actual system, the 
frictional e in the feed line will remove these purely imaginary 


TENESSE 
aea 


Peet 
NARF IERES 


REN TRENI 
oe, 


LRE Ea 
i dA 
a 

Lice ene 


Fic. 8.6 


poles of the required transfer function F(s) and replace them by two 
complex conjugate poles. 

It should be emphasized that the advantage of using a feedback servo 
to stabilize the combustion lies in its great flexibility in being able to 
obtain unconditional stability with any value of time lag ô or r. In as 
much as there are no accurate data on the time lag, the possibility of 
unconditional stability is of very real, practical importance. Even more 
than this, servo stabilization also makes it possible to design the system 
for stability against any expected variation in the parameter n. From 
physical reasoning, n probably takes a value between 3 and 1. Take 
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the worst possibility, and design the system for unconditional stability 
with n = 1. Then the system will be stable for all expected values of n. 
Therefore stabilization by feedback servo can be assured without having 
to know the exact values of the parameters of the system. 

8.6 General Stability Criteria for Time-lag Systems. In the preced- 
ing discussion of servo stabilization, it is assumed that the polynomial 
H(s), of Eq. (8.84), has no pole or zero in the right-half s plane. This, 
however, is not necessarily the case. In general then, one should first 
investigate the number of zeros and poles of H(s) in the right-half s plane. 
To do this, it should be recognized that the polynomial in Eq. (8.34) 
before the factor F(s) usually does not have zeros in the right-half s plane. 
Therefore, instead of studying H(s), one can study the ratio of H(s) to 
that polynomial. That is, the number of zeros and poles of H(s) in the 
right-half s plane is the same as the number of zeros and poles of the 
following function: 


H(s) + (aE + {JEU +a(P +2] + (JEP /2n)} s? + { oH (P + 4) 
+ (OBP/n)(P +8} s + {1+ oP +H) 
+ (PY =1+L() (8.36) 


where 
L(s) = = sF(s) [ze + daJE(P +3) 8° + Js + a (P + D| 
+ [args + {WA + a(P +4)] + JEP/2n)} sè + faE(P + 4) 
+ (aEP/n)(P +4)} s+ {1 +alP +4) + Pin} (8.37) 


According to the Nyquist criterion, the number of poles and zeros for 
1 + L(s) in the right-half s plane can be found by plotting the Nyquist 
diagram of 1 + L(s) with s tracing the curve of Fig. 4.4. In fact, 
if 1 + L(s) or H(s) has r zeros and q poles in the right-half s plane, then 
L(s) will carry out r — q clockwise revolutions around the point —1 as s 
traces the semicircle. Hence the necessary information on H(s) can be 
obtained by plotting the Nyquist diagram of L(s). 

When one divides Eq. (8.30) by H(s) in order to get gi(s) and g2(s) 
as given by Eq. (8.35), q zeros and r poles are introduced in the right- 
half s plane. The g poles of L(s) must come from F(s), since the poly- 
nomial in the denominator of Eq. (8.37) has no zero in the right-half 
s plane. Therefore the original expression in Eq. (8.30) also has g poles 
in the right-half s plane. Hence in order for the original expression in 
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Eq. (8.380) to have no zero in the right-half s plane, go(s) must make 
—q + (q — r) = —r clockwise revolutions around the unit circle. In 
order for the stability to be unconditional, t.e., stable for all time lag, the 
go(s) curve should never intersect the unit circle. Therefore the general 
unconditional stability criteria are, first, the go(s) curve must lie completely 
outside the unit circle; and, second, g(s) must make r counterclockwise 
revolutions around the unit circle as s traces the conventional path enclosing 
the righi-half s plane. These are the criteria for stability with the Satche 


7 (Full Curve for Positive w; 
€ Dotted Curve for Negative w) 


(a) 


Nyquist Diagram for L (s) 
With Two Zeros for 1+L (s) 
(b) in Right Half s -plane 


Stable Satche Diagram 
Fia. 8.7 


diagram. To determine r, one has to use the Nyquist diagram of L(s), 
Eq. (8.37). Thus the stability problem for the general case requires 
both the Satche diagram and the Nyquist diagram (Fig. 8.7). 

It is evident that the stability criteria developed here using a combina- 
tion of the Satche diagram and the Nyquist diagram are applicable to 
any system with a time lag r. The stability of such systems always 
reduces to the question of ascertaining whether there is root of 


M(s) = 0 


with a positive real part, where M(s) contains terms with the factor 
es. The principle of the method, as seen from the previous discussions, 
is to divide M(s) by the coefficient of e~: in M (s), such that 


ize = Glo) = pl) = ox) 


and 
gi(s) = e~ 

The curves of gi(s) and go(s) as s traces the right-half semicircle shown 

in Fig. 4.4 then constitute the Satche diagram, with gi(s) represented 
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by the unit circle. To find out whether dividing M(s) by 1 + L(s) has 
introduced roots with positive real parts to the Satche diagram, we have 
to plot the Nyquist diagram of Z(s). The number of roots of M(s) = 0 
with positive real parts can then be determined by Cauchy’s theorem. 

The go(s) function contains the transfer function of the feedback link 
with an amplifier which is under the designer’s control. g(s) may also 
contain transcendental functions of s coming from other parts of the 
system. Since the transfer function of the amplifier in the feedback 
link is generally a ratio of two polynomials in s, it is difficult to compen- 
sate completely for the destabilizing effects of a transcendental function. 
However, in the Satche diagram, the critical part of the gə(s) path is 
that part close to the unit circle of gi(s). But g(s) close to the unit 
circle is generally obtained by small values of s. Hence for the critical 
part of go(s), the transcendental function can be expanded into a power 
series in s. The amplifier of the feedback link can then be designed 
using a few terms of the power series as an approximation. Thus the 
amplifier compensates the destabilizing effects of the system in the critical 
region. Of course, the performance of the system should be checked 
finally by the stability criteria developed, using the amplifier design 
characteristics. This is the procedure suggested by F. E. Marble. For 
details, the original work! should be consulted. 


1 F. E. Marble and D. W. Cox, J. Am. Rocket Soc., 28, 75-81 (1953). 


CHAPTER 9 


LINEAR SYSTEMS WITH STATIONARY 
RANDOM INPUTS 


In the previous chapters, the inputs to a system are considered to be 
definitely specified functions of time t. However, there are many engi- 
neering problems for linear systems with constant coefficients where the 
inputs cannot be so definitely described. An example of such an engi- 
neering problem is the problem of the motion and the stresses induced 
in the structure of an airplane wing in a turbulent air stream. Here the 
input can be considered to be the time-varying air-flow pattern. The 
air-flow pattern cannot be described as a definite function of time but 
has to be recognized as a random function of time, specified by certain 
statistical characteristics. It is then evident that the output of the 
system, the stresses in this case, must also be a random function and 
can also be described only in statistical terms. The first objective of 
this chapter is then to find a convenient method of calculating the statis- 
tical properties of the output from the specified statistical properties of 
the input. ‘This forms an easy extension of the early investigations by 
P. Langevin of Brownian motion. 

Another example of random input is the so-called noise in control 
signals. The noise is introduced by disturbances and fluctuations beyond 
the control of the designer. The problem of noise is a subject of much 
research in connection with communications engineering. There, the 
central question is how to design the system so that the effects of the 
unavoidable noise can be minimized and the useful information of the 
signal not destroyed. We shall discuss this particular problem of noise 
filtering in Chap. 16. The problem of this chapter is, however, somewhat 
different. In our present problem, the random output is the only output 
of the system. Our purpose in the design of the system, particularly the 
design of the feedback servomechanism, is to obtain with a given input 
an output of the desired statistical characteristics. We shall see that 
the transfer-function method developed in the previous chapters remains 
useful in the present task. 

9.1 Statistical Description of a Random Function. Let us consider a 
system which generates a random function y(t). Now to formulate 
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the concept of a statistical description of such a random function, we 
have to consider a great number of systems identical to the first. Such 
a group of systems is called an assembly. The random functions gen- 
erated by the members of the assembly are ¥1(¢), y2(t), ys(t), . . . . The 
random character of the function is exhibited by the fact that, although 
the systems are identical, the value of the function generated by any 
member of the assembly at any specified instant of time ¢ is generally 
different from the value generated by another member at the same time 
instant. But we can ask for what fraction of the total number of the 
systems y occurs in a given range y to y+dy. This fraction will 
depend on y and t and will be proportional to dy when dy is small. This 
fraction is the probability that y will lie between y and y + dy at time ż. 
It is written as Wi(y,t) dy. The function Wy(y,t) is called the first 
probability distribution. Next we can consider all the pairs of values of y 
occurring at two given instants ¢; and ft. The fraction of the total num- 
ber of pairs in which y occurs in the range yı to yı + dyı at tı and in the 
range yz to yo + dy: at tz is written as We(yi,tijy2,t2) dyı dyz The func- 
tion Wo(y1,t1;y2,t2) is called the second probability distribution. Higher 
probability distributions can be similarly constructed. 

The fact that the above formulation of the statistical description of 
the random function depends upon observations carried out simul- 
taneously on a very large number of identical systems may be objection- 
able on the ground of practical difficulty in observation. However, if the 
random function is a stationary random function in the sense that all 
statistical characteristics of the function are time independent, then the 
large assembly of identical systems is not necessary—all necessary obser- 
vations can be made on a single system, for a very long period of time. 
The record of observation can then be cut into pieces of length 0, O being 
large in comparison with the characteristic time of the function. Then 
each piece contains the same statistical information about the behavior 
of the system, since statistically the origin of the time scale is of no conse- 
quence. The different pieces can then be considered as an assembly of 
observations on identical systems, and the various probability distribu- 
tions can be determined. Furthermore, these distributions now become 
somewhat simpler: W, will be independent of time t, and W will be 
dependent only upon the time interval7 = t — tı Hence for stationary 
random functions, W1(y) dy is the probability of finding y between y and 
y + dy; We(yr,y2;7) dy; dy: is the probability of finding a pair of values 
between yı and yı + dy, and between yz and y2 + dy: at an interval of 
time equal to r. Since the random functions of engineering problems 
can very often be considered as stationary random functions, we shall 
limit the following discussion to such random functions. 

It should be emphasized that these probability distributions embody 
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all the information about the statistical properties of a random function. 
Or we may say that the probability distributions “define” the random 
function. Of course, these distributions W, are not arbitrary but must 
satisfy the following conditions: 
(a) Wa = 0, because there is no such thing as negative probability; 
(b) Wn is symmetric with respect to its variables y,, t.e., 


Weolyi,yast) = Wolyo,yr5r) (9.1) 


This is obvious from the meaning of W» as a joint probability distribution. 
(c) Higher distributions imply lower distributions. That is, 


J. Wun) dye = Win) = Wily) (9.2) 


where the integration is to be carried over all possible values of yz. Note 
that integration over y, also eliminates 7. Furthermore, 


in Wily) dy =1 (9.3) 


This simply means that the probability of all occurrences must be a 
certainty. 

9.2 Average Values. From the first probability distribution W,(y), 
the average value Ņ of y can be found: 


g = |, yW) dy (9.4) 


Since we have limited ourselves to stationary random functions, the 
average value can be also obtained by taking the time average of y(t). 
That is, 


1 [e 
j = lim 5 J, y(t) dt (9.5) 


The equality of the assembly average, Eq. (9.4), and the time average, 
Eq. (9.5), is a characteristic of stationary random functions. We shall 
utilize this property repeatedly in the following calculations. 

Equation (9.4) can be generalized to arbitrary powers of y. Thus 


ma = y= |” W) dy (9.6) 


m, is called the nth moment of the first probability distribution. From 
the first and second moments, we can compute what is called the fluctua- 
tion, variance, or mean deviation o: 


= Y= = |, U- HW) dy 
=|" w-2v+OIM@dy=F-@ (9.7) 
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¢ is thus a measure of the “width” of the probability distribution W1(y) 
about the average value 7. Similarly, the third moment gives a measure 
of the skewness of the probability distribution. More and more informa- 
tion about Wi(y) can be deduced as more moments are known. For 
some cases, the knowledge of moments uniquely determines the distribu- 
tion. For instance, if 


Moisi = 0 fork 0 a 12pcs 
my = 1:3:5- ++ (2k — 1)o” (9.8) 


then the first probability distribution W1(y) is the well-known Gaussian 
distribution, or normal distribution, 


_ 1 —y 2/202 

Wily) a ./ On € (9.9) 

Sometimes it is convenient to choose the origin of the y coordinate 
in such a way that 7 vanishes, z.e., the origin is the average value of y. 
When this is done, we say the probability distribution is normalized. 
In this case, the square of the mean deviation is simply the second 
moment y?, as seen from Eq. (9.7). 

The most important average value derived from the second probability 
distribution W2(y1,y2;7) 1s the correlation function R(r). It is defined as 


Yyy = yy + 7) 
L Jo Y1Yy2W2(Y1,y2;r) dyı dyz (9.10) 


R(T) 


For a stationary random function, this is clearly also obtainable from 
time averaging: 
TE E 
R(r) = lim a y(t)y(t + 7) dt (9.11) 
Q@—> 00 -0/2 
The function Æ(r) thus gives a measure of the interrelation of the y’s 
measured at two different time instants. It is to be expected that as 
the time interval 7 increases, the interrelationship or “memory” must 
be weakened and, in the end, when r is very large, y(t) and y(t + 7) 
will be independent of each other. Then according to the rule of prob- 
ability calculus, the second probability distribution is equal to the 
product of Wi(yi1) and Wi(y2). Thus for large r, 


RG) = |À, fZ, yyWiy) Wigs) dyn dys = (g)? (812) 
For 7 = 0, it is obvious from Eq. (9.11) that 
R(0) = 4? (9.13) 
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Now since R(r) can be calculated with an arbitrary shift of the origin 
of the time scale, 
R(t) = yyt +7) = yt — ryk) 


If we differentiate the equation with respect to 7 and then put r = 0, 


R'(0) = yOy'® = —yOy'@ 


R'(0) = y@y’® = 0 (9.14) 
In these equations, a prime indicates differentiation with respect to time. 
Thus the correlation of a random function with its derivative at the 
same time instant is zero. This means that the slope of the record of y 
at any y has an equal probability of being negative or positive. 
If we differentiate Eq. (9.14) twice with respect to 7 and then set 
r = 0, we have 


Therefore, 


B'(0) = yy À = -y? (9.15) 

This equation allows us to calculate the mean square of the derivative 

of y from the correlation function. Similarly, the mean square of the 
second derivative of y can be calculated as 

R” (0) = y” (9.16) 


9.3 Power Spectrum. Of special significance for our applications of 
the theory of random functions is the notion of the spectrum of a random 
function. Let us suppose that a function y(t) is observed for a long time 
©. If y() is assumed to vanish outside the interval, then y(¢) can be 
developed into a Fourier integral! 


y(t) = |7, Aloe de (9.17) 


where A(w) is the amplitude of the frequency w. It can be computed 
from y(t) by the inversion formula: 


1 fe? 
Alw) == i / y(t)e di (9.18) 
2 


2a —6 


If we denote A*(w) as the complex conjugate of A(w), then, since y(t) is 
real, Eq. (9.18) indicates that 


A*(w) = A(—w) (9.19) 
Now we can calculate the average y? in terms of A (w) as 
1 (9? 
= lim = | y?(t) dt 
Ə J-e/2 


l 8/2 © 20 
= lim = | dt | | dw dw’ A(w) A (weete 
@—> a O — 6/2 — ojo 
1§ee for instance Whittaker and Watson, “ Modern Analysis,” Sec. 9.7, p. 188, 
Cambridge-Macmillan, 1943. 
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By the substitution w” = —w’, we have 


= lim 4 aa J, dw do” A(w)A*(w’’) D eie—e”)t dt 
e+ o O 
as E 9 J A(w) A*(w n iee do des!" 
O- w Om ay! 


If we now introduce the new variable ¢, defined as 


then 


Consequently, 


eee Oe a * sin & 
= im 2 | aoda f y k 


= 4r lim sf |A(w) |? dw 
Q~ a% 0 0 
Therefore, if we put 
S(u) = lim = |A(w)|? (9.20) 
6 œ 
then 
-5 = (z s] 
P= f Blu) do (9.21) 
The function &(w) is thus a real function and is called the power spectrum 
of the random function. Equations (9.20) and (9.21) enable us to com- 
pute the average value y° from the Fourier coefficient A (w). This rela- 
tion is the Parseval theorem. 


Let us consider next the correlation function R(r). By combining 
Eqs. (9.11) and (9.17), we have 


il 6/2 
R(r) = lim ate, y(t)y(t + 7) dt 
e- v 9 —0/2 
0/2 
= lim = 9 J. A (w) A (w) dw dw’ | todt dt 
— 06/2 


Then, by an argument similar to the above, we have 


R(r) = i * P(w) cos wr dw (9.22) 
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By setting 7 = 0, Eqs. (9.21) and (9.22) reduce to the relation of Eq. 
(9.13). By differentiating Eq. (9.22) with respect to 7 and then setting 
7 =0, Eq. (9.15) is obtained. According to the Fourier inversion 
theorem, 


lw) = : I R(T) cos wr dr (9.23) 


Equations (9.22) and (9.23) allow the computation of either the correla- 
tion function or the power spectrum when one of these is known and are 
called the Wiener-Khintchine relations. 

The power spectrum (w) may contain peaks of the Dirac 6-function 
type. This certainly is the case when 9 is not zero, or, in the terminology 
of electrical engineering, when there is a d-c term. Then 


lw) = 20) lw) + Pil) (9.24) 
where 6(z) is defined as 
6(—zx) = 6(x) = 0 for z #0 
ôlz) > œ% forz = 0 (9.25) 


such that i ô(x) dx = 1 and i ” ô(z) dx = 5 


For pure “noise,” the peak at w = 0, corresponding to the d-c term, 
will usually be the only peak, so that (w) will be a regular function, 
representing the really continuous spectrum. But it is also possible to 
have several sinusoidal oscillations superimposed upon the noise. In that 
case, the power spectrum will have additional peaks at the discrete 
frequencies of these oscillations. 

9.4 Examples of the Power Spectrum. We shall show a few power 
spectra computed from correlation functions. If the correlation function 
is given by a Gaussian curve, 


R(r) = R0)” (9.26) 


then, according to Eq. (9.23), the corresponding power spectrum is 


P(w) = : R(0) I COS (wre dr = (0) eta) 


where (0) = n 


It is interesting to note that as a— œ the correlation function becomes 
zero for all finite 7, and R(0) — © in such a manner that R(r) becomes 
a ô function. This means that subsequent y’s are not correlated at all 
and that the random function is the “most chaotic” of all. As a— œ, 
the power spectrum is a constant, independent of the frequency. This 


(9.27) 
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most chaotic random function is called the white noise and often describes 
the naturally generated random variations. 

Another example is the small isotropic turbulence in a fluid flow of 
otherwise uniform velocity. It has been shown by von Kármán and 
Howarth! that the fundamental second-order correlation functions are 

Ri(r) and R2(7): Ri(r) is the correla- 


aaa * >, Jlr) tion function of the fluctuating ve- 
i locity parallel to the mean flow direc- 

t tion at the same space point, but 

ENE at time interval r apart (Fig. 9.1). 

Fre. 9.1 R,(r) is the correlation function of the 


fluctuating velocity normal to the 
mean flow direction. If U is the mean velocity and Lisa scale of turbu- 
lence, these correlation functions can be approximately expressed as 


R(t) = R0) (9.28) 
l rU 
Rafe) = Rae" (1-37) (9.29) 
By using Eq. (9.23), the power spectra ®i(w) and 2(w) for velocity 


fluctuations parallel and normal to the mean flow direction, respectively, 
are 


a(o) = 0,(0) HEELT) (931) 


[hr (L/U) 
where $,(0) and #.(0) are the values of the power spectra at w = 0. 
They are related to Rı(0) and R2(0) by 


6,(0) = 25 R,(0) | 
k (9.32) 
8,0) = =F BO), 


9.5 Direct Calculation of the Power Spectrum. It is not necessary, 
of course, to calculate the power spectrum from the correlation function. 
Sometimes it is possible to determine the spectrum directly from the 
specified character of the random function y(t) itself. Let us consider, 
for example, the case where y(t) consists of a series of pulses that have 
identical shape and a constant repetition frequency but whose heights 
vary according to some probability distribution. The heights of succes- 
sive pulses are, however, uncorrelated. This is shown in Fig. 9.2 for a 
rectangular pulse. If y(¢) represents a single pulse of unit height, then 


y(t) = Y a(t — kD) (9.33) 


k 
l von Kármán and Howarth, Proc. Roy. Soc. (A), 164, 192 (1938). 
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where T is the spacing of the pulses and a, the height of the kth pulse. 
The first step in computing the power spectrum is to determine the 
Fourier spectrum A(w) by Eq. (9.18). Let 6 = 2NT, then 


NT 
foe N yei di = Z 9, on (t — kT) di 
Or NT 
N 
= > Qe TET a | n()eé dé = alw) y a,e7 ek 
2r sae 
N —N 


where a(w) is the Fourier spectrum of the single pulse, 


a(w) = L i n(éje*#é dé (9.34) 
Py ae me 
For a rectangular pulse of width 2e and unit height, 
aye [fe ' et ge = 1 sin oe (9.35) 
20 T W 


According to Eqs. (9.19) and (9.20), the power spectrum is 


lw) = 4r me w)/? lim SN D asaaeiea or | (9.36) 


To carry out the limiting process in Eq. (9.36), it will be convenient first 
to make an assembly average of the whole equation. Since the power 


a2 
as 
ay ay as 
ft 
SE SEE SN ee see ae eee 
Fra. 9.2 


spectrum (w) is the same for every member of the assembly, the left 
side of Eq. (9.36) is not changed by averaging over the assembly. The 
right side of Eq. (9.36) will be simplified by such an averaging process. 
Let @ be the average of a; and a, and a? the average of the square of 
a and a. Then 


arm = (ar — a) (a — &) + alla — a) + (a — a] + (0)? 


We substitute this expression into the right side of Eq. (9.86) and then 
average over the assembly. Then since the successive heights of the 
pulses are not correlated, the assembly average of (a, — @)(a: — @) is 
zero, unless k = 1. When k = l, the assembly average of (a, — @) (a; — @) 
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is a? — (a). Thus the first term under the limit sign is 


. N N 
1 5 
lim = z D (a, — a (u — de dT = qg? — (ã)? 
Naa 2N 
-N N 


The assembly averages of (a, — ā) and (a, — &) are obviously zero. 
Therefore, finally, 


N 
B(w) = a lac) |? G — (ā&)°] + (a)? lim a Zz J | (9.37) 
-N 


The sum in Eq. (9.37) will be 2N + 1 for w = 2nr/T when n is an 
integer. Hence, in the limit, the second term of Eq. (9.37) will be 
infinite. For other values of w the sum will be finite, and for N — œ 
the limiting value will be zero. Clearly then, in the limit, the second 
term has the character of a series of peaks, or ô functions, at the fre- 
quency 2nr/T. To determine the coefficient of these ô functions, 
we have to calculate the area under the curve for the typical interval 
—r <wT <a. Thus by integrating the sum, the required area is 


—iwkT 
fai ~ar/T “BN v 5 


Since the area under the ô function, according to Eq. (9.25), is unity, 
the required coefficient is 2r/T. Therefore, finally, the power spectrum 
for the specified stationary random function is 


o1 7/™ 1 — cos 2NoT _ 2r 
= gn Teco TST 


B(w) = 2walalw)|? {T — (0) + (@)% + 2, 0 — Neds )} (9.38) 
where wo is the frequency corresponding to the fundamental period T, i.e., 
Oo = 7 (9.39) 


Hence the power spectrum contains a continuous part that has the same 
shape as the power spectrum of a single pulse. The intensity of this 
continuous spectrum is determined by the square of the mean deviation s 
of the pulse heights. There is, in addition, a discrete spectrum at the 
frequencies nwo, for n an integer, where the intensities are also determined 
by the spectrum of the single pulse. 

Let us consider next a series of pulses that have an identical shape and 
height but a repetition period varying around an average value T. The 
spacing between pulses will be T + e, with e distributed according to a 
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specified probability function P(e). The average value of «is by impli- 
cation zero. The successive e’s will again be assumed to be uncorrelated. 
Figure 9.3 shows such a random function with rectangular pulses. 
Therefore the random function is represented by 


y(t) = X at — kT — &) (9.40) 


k 


where n(t) represents the single pulse. According to Eq. (9.18), with 


o = 2NT, 
N 


Alo) = alu) X etro 


-N 


a(w) is the Fourier spectrum of the single pulse given by Eqs. (9.34) 
and (9.35). The power spectrum of y(t), according to Eqs. (9.19) and 


t 
Lote Pere Peele T+é, oltre T+Eé, n 


Fie. 9.3 
(9.20), is then 


N N 
P(w) = 2 ja(w) |? Jim a D ` etinerng tor | (9.41) 
=N -N 


Now let us introduce the function x(w) defined by 
x(w) = L. P(c)e-** de (9.42) 


x(w) is sometimes called the characteristic function of e and is the Fourier 
transform of P(e). We shall write then 


ge wtgtivn — { [eius — x(w)] + x(w) } {[etiwe — x*(w)] + x*(w) } 


where x*(w) is the complex conjugate of x(w) and is equal to x(—w). 
We now substitute this expanded form into Eq. (9.41) and make an 
assembly average first. The limiting process is greatly simplified by the 
fact that the e’s are not correlated. The final result is 


(w) = Qwola(w)|? fr — |[x(w)|?] + [xlo [Pwo ù i(w — no) } (9.43) 


n=Q 


where w is the frequency defined by Eq. (9.39). Here the shape of the 
continuous spectrum and the intensities of the discrete spectrum are no 
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longer determined solely by the spectrum of the single pulse but depend 
also on the characteristic function of the distribution of «. 

As the third example of direct calculation of the power spectrum, 
consider the stationary random function y(t) indicated in Fig. 9.4. The 
function takes the value of either +1 or —1 with the interval T. T, 
however, is not a constant but is distributed according to a specified 
probability distribution P(T), where T > 0. It is further specified that 
the successive time intervals T are not correlated. Let us denote the 
successive intervals by T, where k = 1, 2,..., and take the time 


Ti T; T; T; To Tii 


0 Tz | Ta Te Ta T 0 


period © of integration in Eq. (9.18) to be from { = 0 tot = NT, where 
T is the average time interval defined by 


Ť = n ° TP(T) dT (9.44) 


Then 
N 


1 NT t l 
A (a) — on i y (tye di er A (— 1)* (eiet — em totk-1) 


where t, is the time instant at the end of the kth interval. The above 
expression can be rewritten as 


N 
Alo) = : > », (=p mee > ( — 1) Veist -+- I 
k=1 


T Ua) 


Hence, by Eq. (9.20), we have the — spectrum as 


(w) =e 4] tim J 7 Y ye pe te | (9.45) 
1k’=1 


Now let us consider k > k’, say k = k’ + m, then 
gio (tates) 2s ee Th 19 io Thre a ee ee Tkm (9.46) 
Since the successive intervals are uncorrelated, the probability of occur- 
rence of the product to the right in Eq. (9.46) is the product of the prob- 


ability of occurrence of each factor. If we introduce the characteristic 
function x(w) of the distribution P(T), 


x(a) = $v) + He) = f° P(e? aT (9.47) 
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then x(w) is the average value of e~“?, Therefore the assembly aver- 
age of the product of Eq. (9.46) is simply [x(w)]". In the double sum 
of Eq. (9.45) there are approximately N such products, all with the 
sign (—1)™. Hence in the limit these products contribute a term 
(—1)”[x(w)]”. m ranges from 1 to œ in the limit. The sum of all such 
terms is then 


Cko = - 3S, 


m=) 
The contribution from terms with k’ > kis just the complex conjugate of 
the contribution from terms with k > k’. These contributions are all 
there is to the double sum of Eq. (9.45) with one exception, the case 
k = k’. These values of k and k’ give a contribution corresponding to 1 
within the square bracket. Hence Eq. (9.45) is finally 


or ee x(w) 
Pw) = To (i 2R Fea cc O 


where Q means the “real part of” the expression. Let the real and the 
imaginary parts of x(w) be o(w) and Y(w), respectively, as shown in 
Eq. (9.47). Then 

4 1— glu) — Pla) 


e) = FaF OOF + FO) e 


If the distribution P(T) is Poisson’s distribution, 
l —T/Ť 
P(T) = 7 € (9.49) 


where T is the average time interval defined by Eq. (9.44), then the 
power spectrum of such a randomly switched function of unit amplitude 
1S 

E san on 
T 1 + (w7'/2)? 


The complete lack of any regular periodicity in the random function 
considered makes the power spectrum continuous and smooth without 
any of the sharp peaks of the previous examples. 

9.6 Probability of Large Deviations from the Mean. If the random 
function is the stress in a structure, then it is not sufficient to know the 
average value of the stress. For safety, we shall want to know the prob- 
ability of occurrence of stresses exceeding the specified working stress of 
the structural material, t.e., the probability P[|y| > k] of the magnitude 


(9.50) 


124 ENGINEERING CYBERNETICS 


of the random function y exceeding the value k. If the first probability 
distribution function W:(y) is known, the answer is very simple: 


Pluk >= [Emod h Wady eD 


But in many engineering problems, the probability distribution func- 
tion is not known. What is known is the mean value 7 and the mean 
deviation o. However, even under these restricted circumstances, it Is 
still possible to give general but broad estimates of the probability of 


w(x) 


Fra. 9.5 


occurrence of large deviations from the mean. For instance, if g(y) is a 
nonnegative function of y, then since W,(y) is by definition nonnegative, 


T= ("ow Wy) y>K [| Wie)dy — (0.52) 


The last integration is to be carried over all y’s satisfying the condition 
gly) > K. But the last integral is just Plg(y) > K]. Thus 


Pig) 2 KI < (9.53) 


This is the so-called Chebyshev inequality. Now let 
gly) = y¥ — 9)? 
Then according to Eq. (9.7), 
gy) =a = 4? — 0)? 
where o is the mean deviation from the mean. Let K = k’o*; then Eq. 
(9.53) gives the Bienaymé-Chebyshev inequality 


Plly — 9| 2 ko] < = (9.54) 

The Bienaymé-Chebyshev inequality is known to be too broad for 
most practical applications, and the upper limit given is, in general, 
much too high. A sharper estimation can be given for W,(y) that has 
only a single maximum, the so-called mode. This estimate for “uni- 
modal” distribution is that due to Gauss. To prove Gauss’s inequal- 
ity, let us consider a function w(x) (Fig. 9.5) which is monotonically 
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decreasing in the range x > 0. It is seen that w(x) can be considered 
as a superposition of functions which are constant from z = 0 tor = 2p, 
and zero for z > 2. Let v(x) = 1 for 0 <a < x, and »(x) = 0 for 
£ > zo Then for any K > to, 


K? i v(x) dx = 0 
But if 0 < K < t, 
K? i vlz) de = K(29 — K) 


The maximum of this quantity for K within the range specified is 372}. 
Thus it is generally true that 


Kk? sa v(x) dz < $ a xv(x) dx 
By superposition, we have 
K? 5 wlz) dz < $ k r w(z) dz 


Now consider any unimodal distribution with the abscissa x = y — Yo, 
where y is the mode. Then 


K? o Wile) dz < $ iF r? W (x) dx 


and 
K? [Tf Wale) de < $ [0 vile) de 


By adding these expressions, 
KP(ly — yo| 2 K] < $’ 
where v is the mean deviation from the mode, defined by 
= Y= we = |, U- yo) Waly) dy (9.55) 
Let K = ky; then we obtain the Gauss inequality 


Pily = ul 2 bl < ge (0.56) 


If the distribution is symmetrical, then yo = ğ, v = c, and Eq. (9.56) 
reduces to 


Pily — J| = ko] < a (9.57) 


Equation (9.57) is a sharper estimate of the probability than Eq. (9.54). 
Often it will be possible to assume, at least approximately, a Gaussian 
distribution. ‘Then, by using the asymptotic expansion of the error 
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function, it is easily shown that 


—}k? 
Pliy — g| > ke] = eee fork» 1 9.58 
[ly — g| = ko] a= > (9.58) 


This is a very small probability. For instance for k = 3, the probability 
is only 0.002. Equation (9.54) will say only that the probability is less 
than 0.1111, while Eq. (9.57) will say that the probability is less than 
0.0493. The difference in these estimates is of course caused by the 
different degree of information available to the estimation. The more 
general the assumption, the broader the estimate. 

9.7 Frequency of Exceeding a Specified Value. If the random func- 
tion is the stress in a structure and if the design is to be based upon the 
repeated occurrences of a certain value of stress, t.e., the “fatigue” stress 
of the material, then it is necessary to know the probable number of 
times per unit time the random function will exceed the value y = &. 
The quantity is evidently one half of the number of times per unit time 
the random function will pass through the value ~ Let No(£) denote 
the number of times of passing. This number was first computed by 
S. O. Rice.t We shall follow his method. 

Let IV (y,y’) dy dy’ be the joint probability of having the random func- 
tion y between y and y + dy and the time derivative y’ between y’ 
and y’ + dy’ at the same time instant. This probability can be also 
interpreted as the fraction of time per unit time that the random function 
y and its derivative y’ will simultaneously have values within the specified 
ranges. But crossing the interval dy takes the time dy/\y’|. Hence 
the expected or probable number of crossings at £ and y’ per unit time 
is equal to the quotient of W(é,y’) dy dy’ and dy/ly’|, or |y’|W(éy’) dy’. 
The number No(£) is obtained by integrating over all y’. Thus 


Nol) = f°, WIWE’) ay! (9.59) 
But Eq. (9.15) shows that for any differentiable random function, y and 
y’ are not correlated. Then according to the general laws of probability 


calculation, W(y,y’) is simply the product of the first probability distri- 
butions Wi(y) and W(y’). Hence Eq. (9.59) can be written as 


Nol) = Wale) f, IWO’) ay (9.60) 
When W(y’) is symmetrical, Eq. (9.60) can be further reduced to 
N(£) = 2W (E) f y Wy’) dy’ for W(y’) symmetrical (9.61) 


1§. O. Rice, Bell System Tech. J., 23, 282 (1944); 25, 46 (1945). 
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If W(y’) is a Gaussian distribution, with the mean deviation g’, then, 
according to Eq. (9.9), 
2WilE) f> tog 20'W (é) 
N,(é) = —— tev ft" dy! = 9.62 
o( £) g’ 4/ On 3 y y 2r ( ) 
The mean deviation o’ can be computed from the power spectrum $(w) 
by using Eqs. (9.15) and (9.22): 


poon h ° Blo) dw (9.63) 


If W.(y) is also a Gaussian distribution with the mean ğ and the mean 
deviation c, then with Eqs. (9.7) and (9.21) we obtain 


Gey) a wlw) do |’ 
2 o 0 


es 64) 
h E) do— (5)? 


l 1 
C O tT # =He 


This is the formula given by Rice. 

9.8 Response of a Linear System to Stationary Random Input. We 
shall finally give the answer to the question we posed in the introduction 
to this chapter: Given the stationary random input to a linear system 
with constant coefficients, what is the output? From the exposition of 
the elements of the theory of random functions in the previous sections, 
it is evident that the key to this question is the calculation of the power 
spectrum of the output from the power spectrum of the input. When 
the power spectrum of the output is known, we can easily compute the 
correlation function by Eq. (9.22), and the mean-square value by Eq. 
(9.21). Then the probability of large deviations from the mean and the 
frequency of exceeding a specified value can be estimated by methods 
given in Secs. 9.6 and 9.7. For many engineering problems, knowledge 
of these characteristics of the output is generally sufficient. 
` Let the input be z(t) with the power spectrum (w) and the correlation 
function R,(r). Then by Eqs. (9.21) and (9.22), we have 


gt = a (w) dw = R,(0) (9.65) 
and 


Rila) = f” Blo) cos ordr =} [7 Sordo (9.66) 


where the relation #(—w) = (w), as seen from Eq. (9.23), is used. 
Similarly, let the output be y(t) with the power spectrum g(w) and the 
correlation function Ro(r). Then 


v= [7 gw) do = ReO) (9.67) 
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and 
Ro(r) = ie glo) dw (9.68) 
As before, let h(t) be the response of the linear system to a unit impulse 
att = 0. For a process started at ¢ = — œ, the output can be written 
as 


y(t) = [i ZORE — 7) dr 


where z(r) dr is the impulse applied at the instant t = r. Since h(t) = 0 
for t < 0, the upper limit of the integral can be extended to +. Then 


y() = [7 OAE- r) dr = | z uhlu) du (9.69) 
The correlation function Ro(r) is thus 
Rolr) = Wu FA = f° h PE- wae Fr w)a(wh(u’) du dw 
But 
at — uet Fru) = 10t Fr Fu wv) = Rir +u- u) 
Hence by using Eqs. (9.66) and (9.68) we have 
[E aoe da = f° f° f° DOE huhlu’) du du! dw (9:70) 


Now if F(s) is the transfer function of the linear system, it is the 
Laplace transform of A(t). Thus [ef. Eq. (8.50)], 


Fie) = f° eh(u) du 
Hence Eq. (9.70) can be now considered as 
[T gee da = f”, BOEG) e de 


Therefore the power spectra g(w) and ®(w) are related by the equation 
gw) = O(a) F(iw)F(—iw) = IF (iw) E(w) (9.71) 


where the fact that F(zw) and F(—tw) are complex conjugates has been 
used. 

Equation (9.71) gives the power spectrum of the output from the power 
spectrum of the input and the frequency response of the linear system. 
Even when the frequency response is given in a graph or in a numerical 
table, the power spectrum g(w) can be easily calculated. The usefulness 
of the concept of transfer function and frequency response is thus again 
demonstrated. It is of interest to note that since F (iw) generally vanishes 
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for w— ©, the output power spectrum g(w) goes to zero for w —> œ 
faster than the input power spectrum (w). This has the effect of 
“smoothing” the output function. 

9.9 Second-order System. As a simple example, consider the linear 
system to be of second order. Then the equation of motion is 


d*y dy = 
m Fr +e Ji + ky = z(t) (9.72) 


The transfer function F(s) of the system is thus 


] 1 I 
ms? +es +k k (s?/w) + 2t(s/w) +1 


where wo is the natural frequency of the undamped system and ¢ is the 
ratio of actual damping to the critical damping, defined by Eq. (3.38). 
Therefore 


F(s) = 


1 
~ -B{[@/eo)® — IP + 4f*(w/e0)?} 


The power spectrum of the output is thus 


_ C 
Ke) = BT oT)? — IF F Poo) 


If we are interested in the mean-square output of the linear system, 
Eq. (9.21) gives 


Ped oee O 
y F [(w/wo)? = 1]? ae At? w/o)? (9.74) 


Now if ¢ is very small, the denominator of the integrand in Eq. (9.74) 
is very nearly zero at w = wo. Therefore, if (w) is a slowly varying 
function, then 


F (iw) F'(—t0) 


(9.73) 


— _ woP(wo) |7 dx xf m _ «© (a) 
es I Gai bape OPO Gime ae O 


This equation shows that if the damping coefficient c vanishes, the 
mean-square output will become infinitely large. When c is zero, the 
transfer function F(s) has the purely imaginary pole tw». This phenom- 
enon of infinite output occurs in general whenever the transfer function 
of a linear system has a purely imaginary pole. Therefore, for satis- 
factory operation under random input, the condition on the system transfer 
function is that all poles of F(s) should have negative real parts. This 
fundamental requirement on the system property is then identical to 
that for conventional input functions. 

Other improvements on the output behavior can generally be accom- 


130 ENGINEERING CYBERNETICS 


plished by further modification of the system transfer function. For 
instance, it is quite conceivable that the function ®(wo) /w; in Eq. (9.75) 
has a minimum at a reasonable fre- 
quency w; as shown in Fig. 9.6. Then 
it will be possible to reduce the out- 
put random fluctuation by making the 


2( wo) system operate effectively atw. This 


We Wo 
Fre. 9.6 Fre. 9.7 
can be accomplished by a simple proportional servo feedback, indicated 
in Fig. 9.7. Then instead of Eq. we have 
m od 
mM we +e% z+ ky = 2 ay 
or 


moe +o Yikt oy =a (9.76) 


The natural frequency of the system can then be made to be œf by 
requiring 


ee (9.77) 


> 


Therefore, by proper choice of a, the output can be reduced. 

9.10 Lift on a Two-dimensional Airfoil in an Incompressible Turbu- 
lent Flow. As a second example, consider a flat, thin airfoil of chord c 
moving with a constant velocity U through turbulent air. Let x lie 
along the chord, z along the span of the wing, and y normal to the span 
and cord. The components of the fluctuating turbulent velocity u, v, 
and w are assumed to be small in comparison to U. Because of these 
turbulent fluctuations, a time-dependent apparent angle of attack a exists 
at the airfoil, and, hence, fluctuating lift forces are produced. The fluctu- 
ating angle of attack a is given by 


q| = 


Q = 


as long as the fluctuations are small. a(t) now plays the role of the 
forcing function. The “response” is the fluctuating lift of the airfoil, or, 
better, the lift coefficient Ci(t). This is a problem studied by H. W. 
Liepmann.! 

1H. W. Liepmann, J. Aeronaut. Kci., 19, 793-801 (1952). 
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To find the mean square C?(t) of the lift coefficient, it is first necessary 
to define a transfer function for the airfoil. This has already been done 
in Sec. 3.7. In fact the frequency response F(iw), with v as the input 
and the lift coefficient C, as the output, is specified by Eqs. (3.54) to 
(3.58). 

Turbulent fluctuations are, however, essentially three-dimensional— 
that is, u, v, and w will be functions of x, y, 2, andt. For the first analysis, 
it seems sufficient to consider only the component v and its dependence 
upon zand?. Thus, in turbulent flow we consider a fluctuating velocity 
or angle of attack of the form 
v(x, t) 

U 


a(x,t) = 


If it is now assumed that the turbulence pattern does not change appreci- 
ably during a time of the order c/U, then the turbulent angle of attack 
will also depend upon t — (z/U) only, and Sear’s result, given in Sec. 
3.7, can be applied. This assumption is frequently made in turbulence 
analysis and requires essentially the condition that the rate of change of 
fluid velocity by following a fluid particle is small compared with the rate 
of change of the fluid velocity at a fixed position. With this assumption, 


CP = 4r? i ° (co) |o(k) [2 de (9.78) 


where (w) is the power spectrum of v/U. 
According to Eqs. (9.31) and (9.32), 


v L 1+8 (L%0*/0?) 
U? rU [1 + (Lw?/ U3] 
Furthermore, Liepmann has discovered that |y(k)|* can be approximated 
by 


blo) = (9.79) 


l 
2 ye ooo 
le)? = oR (9.80) 
Thus 
aT , 0? ” 1+ 8u? 1 
i= oa ah G+ wy Tt gu 
a | -r | 7+8 ‘ 
where 
TC 
=+ (9.82) 


This dependence of the mean-square lift coefficient upon the parameter 7 
is shown in Fig. 9.8. 
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Evidently, if c/L — 0, we have an airfoil of small chord in a large-scale 
turbulence, and 


The airfoil behaves in a quasi-stationary manner with a lift slope of 2r. 
If, on the other hand, c/L becomes large, we deal with an airfoil of long 
chord in a small-scale turbulence. It follows from Eq. (9.80) that C? — 0. 
That is to say the “gusts” cancel each other out completely, and the net 
lift is zero, as might be expected. 


1.0 


0 2 4 1 6 8 10 
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9.11 Intermittent Input. A phenomenon of major importance for 
aerodynamic buffeting is the so-called “intermittency” in wake flow. 
That is, the edge of any wake fluctuates with a large-scale motion so 
that a point situated near the edge will sometimes be within the wake 
and sometimes outside. For a tail surface situated near the edge of the 
wake of a stalled or partially stalled wing, this “intermittency” may 
thus be extremely important in determining the lift forces on the tail 
wing. For a crude idea of the effect, consider the flow at the tail as a 
region of uniform down-wash switched on and off at irregular intervals. 
Such a flow is probably a good model for the conditions in the wake of an 
intermittently stalling wing. If the probability of switching over from 
one region to the other is assumed to be governed by Poisson’s distribu- 
tion, one can then apply the power spectrum of Eq. (9.50) with some 
modifications. The mean deviation is not unity but the mean angle 
4//U; and the average time during which the forcing function is 
switched on is the mean interval 7’. Thus the power spectrum is 


ye 7 l 


Ho) = TF Ty Tp 


(9.83) 
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Then the mean-square lift coefficient is approximately 


a -Ble ee 
S Tie Jo f+ OTAI + x(wc/U) 
= a n logn + 
spy mw 
= ba! Te forn= Fp (9:84) 


This relation is plotted in Fig. 9.9. The limiting values at n = 0 and 
4 — © are, of course, the same for the case studied in the previous section. 
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9.12 Servo Design for Random Input. We have already indicated, in 
connection with the response of a second-order system to random input, 
the possibility of improving the behavior of the system by servo control. 
In that instance, however, the feedback mechanism is rather primitive 
in that the feedback control force required is of the same order of magni- 
tude as the input forcing function. In a more practical design, this feed- 
back mechanism can be made much more subtle, so as to reduce the 
control force required. For instance, a wing in a turbulent flow can be 
controlled by a feedback servo which moves the hinged flap. The force 
necessary to move the flap could be quite small in comparison with the 
aerodynamic effects such movement produces. The servo link can be 
thought of as that indicated in Fig. 9.10. The input random function 
is the turbulent air stream. The first aerodynamic transfer function 
F(s) is the relation between the turbulent air stream and the aerodynamic 
lift due to the turbulent air stream. This function is approximated by 
that of Eq. (3.56). As a result of the changing lift and moment, the 
wing is subject to vertical and torsional motion. The relation between 
the aerodynamic forces and the wing motion is given by the structural 
transfer function F(s). The wing motion will have two effects. There 
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are the aerodynamic forces due to wing motion through the second aero- 
dynamic transfer function F,(s). This is the first feedback loop. This 
loop, however, is not under designer’s control. The designer’s control 
is on the second loop. The wing motion can be used to generate flap 
motion through the transfer function F4(s). The flap motion will again 


Aerodynamic | 
F,(s) 


Structure 
F(s) 


Flap Servo 
F,(s) 


Aerodynamic 
Force 
F(s) 


(Random) 


Aerodynamic 
Fs(s}) 
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generate aerodynamic forces through the transfer function Fs(s). Thus 
the input and output relation is as follows: 


Y = F,.(s)[Fi(s)X + F3(s)¥ + Fs(s)Fa(s) Y] 
or 

Oe es Fy(s)Fo(s) 
TEO “TOR + HORE 


The over-all system transfer function F;(s) can thus be modified by 
changing the servo transfer function F4(s). 

If (w) is the power spectrum of the input z, the power spectrum g(w) 
of the output, or the wing displacement y(t), is then, according to Eq. 
(9.71) 


(9.85) 


g(w) = O(w)F,(tw)F’.(— tw) (9.86) 


where F’,(s) is given by Eq. (9.85). Now it is quite conceivable that in 
order to maximize passenger comfort in the airplane, we would wish to 
make the acceleration y” (t) of the wing as small as possible. This means 
we have to minimize y”?. Equation (9.16) shows that the mean square of 
y” can be calculated from the correlation function. But the correlation 
function can be calculated from the power spectrum by Eq. (9.22). By 


combining these equations with Eqs. (9.85) and (9.86), we have 


F'\(4w) F'2(tw) z 


or a aaa A 8 


y(t) = i Í w*B(w) 


The minimization is to be carried out by modifying the servo transfer 
function F,(s). To do so we can construct the servo transfer function 
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with unspecified parameters. Then with all other transfer functions 
F(s), Fe(s), F3(s), and F;(s) fixed and the input power spectrum given, 
say, as Eq. (9.79), y”? (t) can be computed as afunction of these unspeci~ 
fied parameters. The problem of minimization is then an ordinary 
minimum problem with respect to these parameters, and the condition of 
minimization determines the parameters. The resultant servo transfer 
function is then the best transfer function for the purpose of maximizing 
passenger comfort. 

The above discussion is but one example of optimum servo design 
for a specified purpose and specified input. As another instance, the 
design condition could be the minimization of the mean square of the 
elastic stress induced in the wing structure by the turbulent air stream. 
Then the system transfer function will be a different one, but the general 
formulation of the problem remains the same. Such a problem of quan- 
titative optimum design can be considered as one step beyond the mere 
requirement of stability and other qualitative criteria of servomechanisms 
introduced in the previous chapters. This general concept was perhaps 
first formulated by A. S. Boksenbom and D. Novik.! We shall touch upon 
this problem again in Chap. 16. 


1 A. S. Boksenbom, D. Novik, NACA TN 2939 (1953). 


CHAPTER 10 


RELAY SERVOMECHANISMS 


If there is an on-off relay in the servomechanism, the system is called 
a relay servomechanism. As pointed out in Sec. 6.3, the one great 
advantage of a relay servomechanism is its low cost. However, since 
the output of a relay is not proportional to the input, t.e., the input-output 
relation is not linear, the behavior of a relay servomechanism cannot be 
analyzed by a linear theory. In this chapter, we shall first present an 
approximate theory for investigating the stability of relay and other 
similar servomechanisms, based again on a modification of the Nyquist 
criterion. In the latter part of this chapter, the more advanced and 
more difficult problem of how to utilize the inherent nonlinear character- 
istics of a relay to achieve superior performance from the servomechanism 
will be discussed. Unfortunately, this particular subject is still far from 
being completely investigated; the general solution is yet to come. 

10.1 Approximate Frequency Response of a Relay. Let us consider 
a sinusoidal input z(t) of frequency w and amplitude a, 


x(t) = a sin wt (10.1) 


The characteristics of the relay will be idealized in that no time delay is 
considered, and the action is considered to be instantaneous, t.e., there 
is no time lag. On the other hand, the hysteresis of the relay action is 
included: When the input is positive and increasing, the relay output 
changes from zero to the full value of A at z = b. When the input is 
positive but decreasing, the relay output changes from A to zero atx = c. 
b is greater than c. b is called the pull-in current and c the drop-out 
current. Pull-in and drop-out for negative input occur at z = —b and 
x = —c, respectively. The input-output relation can then be presented 
as that of Fig. 10.1. It is evident that there is a phase shift between 
the input and the output. This phase lag @ is 


-Ilona gnat 
o= 3] sin 7 Sin 4 (10.2) 


The output square waves have a time duration of 2a/w, where a is 
given by 


a= 


. b l . b “ C 
+ 0 sin. 5 E sin sin l (10.3) 
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The period of the output is the same as that of the input and is equal to 
27 /w. 
Now the output y(t) can be analyzed into a Fourier series, 


(za) 


y(t) = Y an sin [n(wt — 9)] (10.4) 


1 


The coefficient a; of the first harmonic is simply 
4A . 
Qi = —— SIN Q&Q 
T 


where a@ is given by Eq. (10.3). In a relay servomechanism, Fig. 10.2, 
the relay output is the correction signal which actuates the servo. The 


Controlled 
System 


Fie. 10.2 


servo has generally the property of a filter and greatly reduces the 
importance of higher harmonics. Therefore, as an approximation, we 
may neglect all.the higher harmonics and consider the output to be 
a, sin (wt — 6). Then the ratio of output to input is 
4A sina _. 

F(t) = n ? (10.5) 
F,(iw) can then be considered the frequency response òf the relay. Of 
course it is not the true frequency response. True frequency response, 
as defined in the previous chapters, is a function of frequency only, not 
a function of the input amplitude. The frequency response of Eq. 
(10.5) is, however, a function of the input amplitude a. On the other 
hand, F,(iw) is not a function of the frequency w. Therefore the func- 
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tional notation adopted is not appropriate; but for easy identification we 
shall keep the notation in spite of this. 

When the amplitude a of the input is very large, we have, from Eqs. 
(10.2), (10.3), and (10.5), 


F(tw) = 


- for a —> © (10.6) 


aire 


When the amplitude a is very very small, the relay will not give any 
response. The cutoff point occurs ata =b. Then 


ee eee ee: = 
peaad(’ sin z) fora =b (10.7) 


These limiting values of the relay frequency response are thus completely 
determined by the relay characteristics. 

10.2 Method of Kochenburger. Let us assume for the moment that 
the amplitudes a of the harmonic components of the input to the relay 
are all equal. Then the frequency response F,(iw) of the relay is a com- 
plex constant, given by Eq. (10.5). If Fı(iw) is the frequency response 
of the rest of the circuit in Fig. 10.2, then the over-all frequency response 
of the forward circuit is F,(iw)Fi(iw). If we apply the Nyquist criterion 
of Sec. 4.3, then for stability the curve traced on the complex plane by 
1/[F, (ie) F1(éw)] as w varies from 0 to © must “enclose” the point —1. 
In other words the curve 1/[F,(iw)Fi(tw)] must cross the real axis to the 
left of the point —1. But for constant input amplitude a to the relay, 
F,(ia) is a constant, and the above-stated condition for stability is 
equivalent to requiring the frequency-response curve | /F (iw) to enclose 
the point —F,(tw), as w varies from 0 to ©. This is the basis of the 
frequency-response method of Kochenburger’ for determining the stabil- 
ity of a relay servomechanism. Dutilh? developed a similar method 
independently. 

Kochenburger argues that when the amplitudes a of the harmonic 
components of the input to the relay are not equal, all that 1s necessary 
is to apply the stability condition deduced in the previous paragraph to 
all values of F,(iw) as the amplitude a varies from 0 to œ. The locus 
of —F,(iw) for various a is a curve, starting at the cutoff point specified 
by Eq. (10.7) and ending at the origin of the complex plane. This is 
shown in Fig. 10.3. Kochenburger’s sufficient condition for stability 
is then: The curve 1/F,(iw) must enclose the complete locus of —F, (tw), 
as shown in Fig. 10.3. Figure 10.4 shows the case of complete instability. 
The arrows on the —F,(iw) curve show the direction of increasing ampli- 
tude of the input to the relay. The arrows on the 1/Fi(iw) curve show 
the direction of increasing frequency, starting at the origin with w = 0. 


1 R. J. Kochenburger, Trans. AIEE, 69, 270-284 (1950). 
2 J. R. Dutilh, L’ Onde électrique, 30, 488-445 (1950). 
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Besides these cases of complete stability and complete instability, there 
are various cases of partial stability or instability, with the possibility 
of persistent oscillations of a certain frequency at constant amplitude. 
For instance, Fig. 10.5 shows a case having a convergent point. For 
small amplitudes, the —F.(iw) points are “outside” the 1//'1(¢w) curve, and 
thus the system is unstable. Then the amplitude of the oscillations will 


Fre. 10.3 


Fre. 10.5 Fre. 10.6 


increase. As the amplitude increases, the point —F,(i#) moves towards 
the curve 1/F (iw), and finally the point P, is reached. Then the system 
will oscillate with a frequency and an amplitude corresponding to that 
point. The oscillation is thus self-starting, and the behavior is called 
“soft” self-excitation. There is no tendency to move away from Py, 
because any increase in amplitude will meet damping effects by entering 
the stable region of the diagram. Pis thus a convergent point, and the 
system will always oscillate. Figure 10.6 shows a different case where 
the point P2 of the intersection of the —F,(tw) curve and 1/Fı(iw) curve 
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is a divergent point. The system always tends to move away from that 
point. It is, however, stable for small disturbances. 

Figures 10.7 and 10.8 show yet more complicated cases, having both 
convergent points Pı and divergent points Pa. The system of Fig. 10.7 
will oscillate with disturbances of sufficiently large amplitude. This 
behavior is called “hard” self-excitation. The system of Fig. 10.8 will 
always oscillate unless the amplitude of the disturbance is too large. For 
very large disturbances, the system will diverge. All the cases depicted 
in Figs. 10.5 to 10.8 show the peculiar dependence of the behavior of the 
system on the amplitude of the disturbances, and the possibility of 
persistent oscillation at fixed frequency and amplitude. These are all 


Fre. 10.7 Fra. 10.8 


characteristics of nonlinear systems and are not present in the linear 
systems studied in the previous chapters. Such behaviors are of course 
to be expected from our introductory discussions of nonlinear systems in 
Sec. 1.3. 

10.3 Other Frequency-insensitive Nonlinear Devices. The Koch- 
enburger method is a very effective solution for the problem of stability 
of a relay servomechanism. It can be applied to systems with a con- 
siderable degree of complexity; and it allows the direct inclusion of 
experimentally measured frequency-response data. For most applica- 
tions where the servo after the relay gives sufficient filtering action, the 
neglection of higher harmonics in the relay output is fully justified. The 
prediction of theory is found to be in full agreement with experimental 
observation. Therefore, if the only design criterion is stability, then 
the Kochenburger method solves the problem completely for relay 
servomechanisms. 

In fact, the Kochenburger method is not only applicable to relay servo- 
mechanisms, but can be applied equally well to many other nonlinear 
devices. The essential point of this method of analysis is the discovery 
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that the behavior of the relay is frequency invariant but amplitude 
variant, while the behavior of a linear system is frequency variant but 
amplitude invariant. Now there are many nonlinear devices which have 
the same behavior as the relay. For instance, a gear train with backlash 
is such a device. We can see this in the following way: Let 6, be the 
angular position of the shaft of the driving motor, which is rigidly con- 
nected to the first gear of the train, 
and let 6, be the angular position 
of the shaft of the last gear of the 
train. Then the relation between 0, 
and z can be represented as in Fig. 
10.9, where 26 is the total backlash 
of the train. If 6,, the input to the 
gear train, is a sinusoidal oscillation, 
fə, the output, is a sort of clipped 
sinusoidal wave with a lag in phase 
(Fig. 10.10). It is easy to see that, 
since the relation between 6; and 62 
is not explicitly dependent upon Fic. 10.9 

time, the wave form of 62 will not be 

modified by a change in frequency of 6;. Thus the response of the gear 
train is amplitude variant but frequency invariant. If we denote the 
ratio of the amplitude of the fundamental of 0 to that of the input 6; 
and the phase lag by the response F, (iw), then F,(iw) is a function of a, 
but not of w. This response function can then be used to study servo- 
mechanisms containing such gear trains with backlash in exactly the same 
manner as the relay response function F,(iw) in the preceding section. 


Fie. 10.10 


10.4 Optimum Performance of a Relay Servomechanism. It is to 
be borne in mind that, according to the Kochenburger diagram for stabil- 
ity, the curve 1/Fi(zw) has to avoid the whole locus of the relay response 
—F,(iw), not just the single point —1 as in the case of a conventional 
servomechanism. This is a more stringent condition on other parts of 
the system, and is also the reason why the performance of a relay servo- 
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mechanism tends to be inferior to the performance of a conventional servo- 
mechanism. However, this is not an intrinsic shortcoming of the relay 
servomechanism. In fact, instead of merely asking for stability, we can 
consider the relay as a switching device, capable of producing a positive 
constant correction signal, a negative constant correction signal, or no 
correction signal, and then ask the question: How should we switch the 
relay in accordance with the output so as to obtain the optimum perform- 
ance of the over-all system? For instance, the requirement on perform- 
ance may be the quickest possible return to the normal state after a 
disturbance. This requirement not only guarantees the return to normal 
state (stability) but also specifies the speediest return. The solution to 
this problem of optimum performance is to specify the proper switching 
action of the relay as a function of the output, and this switching function 
is the basis for the actual design of the servo system. A relay servo- 
mechanism designed with this more general point of view will certainly 
have a performance superior to that of a conventional linear servo- 
mechanism, because the nonlinear characteristics of the relay are utilized 
to the fullest extent. 

10.5 Phase Plane. If y is the output and x the input, the differential 
equation of a general second-order system, linear or nonlinear, can be 
written as 


fug) = x) (10.8) 


where the dot denotes differentiation with respect to time. We can 
rewrite Eq. (10.8) as the system 


(10.9) 


If we consider y and ý as the dependent variables, Eq. (10.9) is a system 
of two simultaneous first-order differential equations for the unknowns 
y and y. If the input is absent, x = 0, and if f is not a function of t, 
that 1s, the system is autonomous, as is frequently the case, then the first 
equation of Eq. (10.9) can be solved for dy/dt and gives dy/dt as a func- 
tion of y and y. Then the system can be written as 


3 7 yk(y,y) 
7 (10.10) 
a 


This system of equations does not contain t explicitly. By dividing the 
first equation of Eq. (10.10) by the second, we have 
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E = hs) (10.11) 
This is now a first-order equation, with y as the independent variable 
and y as the dependent variable. After this equation is solved, Eq. 
(10.10) can be used to calculate the relation between ¢ and y. 

Physically, the procedure outlined in the previous paragraph is based 
upon the concept of characterizing the state of the system by y and ġ, 
instead of the more conventional method of specifying it by y andt. If 
y is the variable describing the position of a point mass, then y is the 
“velocity.” can thus be considered as representative of the momentum 
of the point mass. y and y then rep- 
resent the position and the momen- 
tum, respectively, of the point mass. 
Physicists call such a representation 
of state the representation in phase 
space. In the particular case dis- 
cussed, the phase space has only two 
dimensions; it is thus a phase plane. 
The behavior of the second-order 
system is then specified by a curve in 
the phase plane. Every point on the 
curve represents the state of the system at a certain time ¿. Itis custom- 
ary to indicate the sense of increasing time along the curve by an arrow, 
as Fig. 10.11. If the order n of the system is higher than 2, the phase 
space will be of n dimensions, and the behavior of the system is represented 
by a curve in this n-dimensional space. 

The practical advantage of phase-plane representation is that a very 
large number of nonlinear systems of second order are autonomous 
systems and can be expressed as Eq. (10.11), and this equation can be 
solved, at least graphically, by the isocline method. In fact, the char- 
acter of the system is at once clear when the field of directions of the 
curves, as specified by Eq. (10.11), is indicated in the phase plane. The 
use of such geometrical properties of the phase plane is at the heart 
of the theory of nonlinear oscillations and is called the topological meth- 
ods of nonlinear mechanics. 

To translate our previous concepts to the language of the phase plane, 
let us consider the simple problem of a linear second-order system with- 
out a forcing function, 


Fre. 10.11 


dy dy _ 
apt oa ty =0 (10.12) 


which is obtained from Eq. (8.39) by choosing the time unit in such a 
way as to make the natural frequency w unity. ¢ is of course the ratio 
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of the damping of the system to critical damping. For oscillations, 
I| < 1. Equation (10.12) can be rewritten as 

dy bce | 

= = —2y — y | 

(10.13) 

dY 

TY 
Therefore, corresponding to Eq. (10.11), we have 


dy ary + y y 
OO nes ea 10.14 
7 j t i ( ) 


It is clear then that the lines of constant slope dy/dy are radial lines from 
the origin of the phase plane. Figures 10.12 to 10.16 are the five cases 


os 


Frc. 10.12 Fre. 10.13 


Fre. 10.14 Fie. 10.15 
with ¢ < —1, -l1<¢ <0, ¢=0, 0<¢ <1, and, finally, 1 <¢. 
Figures 10.12 and 10.16 are the nonoscillatory cases, Figs. 10.13 and 
10.15 are the oscillatory cases, and Fig. 10.14 is the case of purely har- 


monic oscillation. 
In the above figures, the origin of the phase plane corresponds to the 
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equilibrium state, where both dy/dt and dy/dt vanish. Mathematically, 
the origin is the singular point of the system of equations of Eq. (10.13). 
The character of the equilibrium state is, however, quite different for the 
three different cases ¢ < 0, ¢ = 0, and 0 < ¢. Figures 10.12 and 10.13 
show that when ¢ < 0, the lines of behavior of the system all diverge 
from the equilibrium state. Therefore the origin is an unstable equi- 
librium point. Figures 10.15 and 10.16 
show that when 0 < ¢, the lines of 
behavior all converge to the equilibrium 
state. Then the origin is a stable equilib- 1<t 
rium point. Mathematically, the origin 

in Figs. 10.12 and 10.16 isa point through N 
which all lines of behavior pass and is N 
called the node. The origin in Figs. 10.18 q 


Nts 


and 10.15 is the center of the spirals 

and is called the focus. In the special 

case of Fig. 10.14, where ¢ = 0, the lines 

of behavior circle the origin. The origin 

is then called the center. Fie. 10.16 
If the basic equation of the second- 

order system has a constant hei term, t.e., 


a ae a e+ y=C (10.15) 


where c is a constant, then 
Dap ae 
Py 29) 4 op AVA 9 5 ty — 6) =0 
Therefore the lines of behavior in n phase plane are entirely similar to 
those indicated in Figs. 10.12 to 10.16, with only the modification of 
translating the equilibrium point to y = c, on the y axis. 

10.6 Linear Switching. In the subsequent discussion, we shall sim- 
plify the problem of switching the relay by assuming only two states 
for the relay: unit positive output and unit negative output. The speci- 
fication of unit output is evidently not a restriction to our problem. But 
before attacking the optimum switching problem proper, let us consider 
the simpler case of linear switching, i.e., the case where the forcing func- 
tion c generated by the relay is equal to unity but has the same sign as 
ay + by. The purpose here is to demonstrate some of the characteristics 
of the problem. 

The performance of a linearly switched relay servomechanism is 
analyzed by Fliigge-Lotz,! and Fliigge-Lotz and Klotter.? The following 


1 I. Fligge-Lotz, Z. angew. Math. Mech., 25-27, 97-113 (1947). 
2 I. Fligge-Lotz and K. Klotter, Z. angew. Math. Mech., 28, 317-337 (1948). 
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discussion is a summary of the qualitative results of their investigation. 
The differential equation of the system studied by the above authors is 


Th tog Vt y = sen (ay+by) for0<¢<1 (10.16) 
A curve in the phase plane with unit positive forcing function is called 
a Parc. <A curve with unit negative forcing function is called an N are. 
The system of all P arcs is called the P system, and the system of all N 
ares is called the N system. It is clear from our discussion in connection 
with Eq. (10.15) that for the particular equation under study, the P 
system is a system of converging 
spirals with its focus at the point 
y= +1,y7 =0; and that the N 
system is a system of converging 
spirals with its focus at the point 
y= —1, y=0. The desired end 
state of the system is, of course, the 
origin y = y = 0. 

According to the signs of a and 
b in the switching function of Eq. 
(10.16), four cases can be defined. 
Let us call Case 1 the case with 
a>b,b>0. Then the switching 
line ay + by = 0 is a straight line 
passing through the origin of the 
phase plane and liesin the second and the fourth quadrants. To the right 
of it, the sign of ay + by is positive, and the region is that of a P system. 
To the left of it, the sign of ay + by is negative, and the region is that of an 
N system. At the switching line, the two systems join, and there the 
corners of the curves of behavior occur (Fig. 10.17). The condition for 
the existence of a periodic solution is that there should exist a P arc whose 
intersections with the switching line are equidistant from the origin; for 
then, by symmetry, there exists an N arc on the other side of the switching 
line joining the same two points, and these two arcs together form a closed 
curve in the phase plane. Such periodic solutions are called limit cycles in 
the terminology of nonlinear mechanics. We shall see presently that a 
periodic solution can occur in our case. 

Let Sp and Sy be the points on the switching line where a P and an N 
curve, respectively, are tangent; let Rp and Ry be the last intersections 
preceding Sp and Sy with the switching line of the P and N curves 
through these points. Sp and Sy are symmetrical with respect to the 
origin, as are Hp and Ry. Suppose that ¢, a, and b are such that Ry is 
outside the segment SpSy, as shown in Fig. 10.18. A solution starting 


Switching Line 
ay+by=0 


Fia. 10.17 
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sufficiently near the segment SpSy will move away from the line in one 
direction or the other, according to the side of the line on which its 
initial point les, and never return to the switching line at all. It can 
also be shown that every P arc which lies to the right of the switching 
line and has its end on this line also has the property that its terminal 
point is nearer the origin than its initial point; hence the condition for a 
periodic solution can never be satished. The lines of behavior of the 
system then always spiral to one of the foci, and the end state is not the 
origin of the phase plane. 

If, however, Ry and Rp are on the segment SpSy (Fig. 10.19), then 
there exists a periodic solution. The reason is as follows. The P arc 


Closed Cycle 


Fie. 10.18 Frc. 10.19 


ReSp starts at a point which is nearer to the origin than its end point, 
according to our assumption. However, at large distances from the 
origin, where the effects of shifting the foci of the converging spirals 
from the origin to +1 and —1 for the P system and the N system, respec- 
tively, are negligible, a P arc must start on the switching line at a point 
farther away from the origin than its end point on the switching line. 
These arcs then have the reverse property of RpSp. Therefore, by 
continuity, some intermediate P arc of this type must begin and end 
at the same distance from the origin. This is the condition for the 
existence of a periodic solution, which is shown in Fig. 10.19 as a closed 
cycle. Extended analysis bears this out and shows that the periodic 
solution is unique and, more important, orbitally stable: All solutions 
beginning outside the periodic solution spiral onto it, and those solutions 
which begin inside the periodic solution but outside the area enclosed by 
RpSpRySy, the shaded area in Fig. 10.19, also spiral onto it. Lines of 
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behavior beginning within the shaded area will spiral to one of the foci. 
Here again we have no possibility of reaching the origin. 

Case 2 is the case where a > 0, but b <0. Now the switching line 
ay + by = 0 passes through the first and third quadrants. The P sys- 
tem and the N system are the same as in the previous case. No periodic 
solution occurs in this case. Let Re, Ry, Sp, and Sw be defined as before; 
then the points Rr, Sp, 0, Sy, and Ry lie on the switching line in this 
order, as shown in Fig. 10.20. But on the interval SpSy a new phenome- 
non occurs. Consider any solution which reaches this interval, say at 
the point Æ. What does the solution do at this point? It should pro- 
ceed along an N arc, because switching occurs on this line. However, the 
N arc from E goes back into the same half plane from which the solution 


Fie. 10.20 


entered E, and on this side a solution can contain only P arcs; on the 
other hand, the solution certainly cannot follow the P are through E 
beyond this point. We may say then that the solution is not defined 
beyond E, it “ends” at Æ. Any solution starting outside the region 
RpRy spirals in toward the origin until it reaches the interval RvA». It 
it reaches the interval RpSp, it will spiral to +1. If it reaches RySy it 
will spiral to —1. If it reaches SpSy, we have the curious phenomenon 
of “ending” the solution. 

In reality, the behavior of the system cannot “end” but must go on. 
The paradox is resolved by the observation that switching action always 
has a time lag, and when a solution meets the switching line, it actually 
proceeds for some distance beyond it before it has the change of sign 
of the forcing function. In Case 1 such a time lag, provided that it is 
not too large, does not affect the essential behavior of the system. But 
in the present case, the time lag avoids the necessary “end” of the solu- 
tion. Consider a solution entering an end point. Because of the time 
lag, it no longer ends there, but proceeds for a certain distance beyond; 
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then switching occurs, and the solution in the phase plane makes a 
“corner,” where the solution is still defined. From this corner, it crosses 
the switching line in the reverse direction, moves for a short distance 
beyond, has another corner, and so on, as shown in Fig. 10.21. From 
that figure, it is also seen that such zigzag action of the system results 
in its creeping out of the region SpSy, and eventually the solution will 


Fre. 10.21 


spiral into one of the foci. Time lag will thus avoid the ending of the 
solution, but still the behavior of the system is unsatisfactory because 
the solution is not able to reach the origin. 

Case 3 is the case where a < 0, b > 0. In this case the switching 
line lies again in the first and the third quadrants of the phase plane. 
But the P system now lies to the left of the switching line, and the N 
system now lies to the right of the switch- . 
ing line. In this case, a stable periodic 
solution, or a stable limit cycle (Fig. 
10.22), always exists, and it dominates 
the whole situation, for all other solutions 
spiral into it. Here again the origin of 
the phase plane cannot be reached. 

Case 4 is the case where both a and b 
are negative. The switching line is the 
same as in Case 1, but the arcs are the 
same asin Case3. It may beshown that 
no periodic solution can exist in this 
case. Without time lag in the switching 
action, the segment SpSy (Fig. 10.23) consists of end points, and by tracing 
the solutions which end on it backwards one can see that these cover the 
entire plane; thus in this case all solutions “end” on the segment Sp8y of 
the switching line. 

But here again, as in Case 2, the presence of time lag makes a differ- 
ence. The time lag makes no difference of importance until the solution 
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in question reaches SpSy; then, instead of ending, the solution proceeds 
for a small distance beyond the switching line, has a corner, crosses the 
switching line again, has another corner, and so on. It may be seen 
from Fig. 10.23 that this motion forces the system to the origin. The 
system will finally oscillate at high frequency and small amplitude 
around the origin—the smaller the time lag, the higher the frequency. 
This is what is called “chattering” of the servo system. 

It is thus seen that out of the four cases discussed, only Case 4 gives 
a system seeking the desired equilibrium state. But even then, the 


ay+by=0 
Fic. 10.23 


system will chatter near the equilibrium state. Our discussion of the 
analysis of Fliigge-Lotz and Klotter then demonstrates the shortcoming: 
of linear switching. Optimum switching for best performance of the 
servomechanism is definitely not linear switching. In the following 
sections, we shall show this. 

10.7 Optimum Switching Function. An autonomous second-order 
system with forcing function of unit magnitude can be written as 


dy _ 
a 
ay (10.17) 
g T Id) = eud) 


where ø(y,ġ) is a discontinuous function with only the two possible 
values +1 or —1. Then the optimum switching problem can be defined 
as follows: find the function y(y,y) such that, beginning at any point y 
of the phase plane, the solution will pass through the origin 0, and the 
length of time necessary to move from p to 0 along the solution from 7 
is minimal with respect to e—no other ¢ could make this time shorter. 
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Then ¢(y,9) is the optimum switching function. This particular switching 
problem was studied by Bushaw,! and the special case of linear g(y,¥), 
that is, g(y,y) = 2ty + y, was completely solved by him for all real 
values of ¢. The mathematical arguments which led to Bushaw’s results 
are, however, complicated and are difficult to extend to other cases. We 
shall thus limit ourselves to indicating his solution. 

A general result, good for any continuous g(y,y), is Bushaw’s canonical 

path. A path is a line of behavior in the phase plane. Since o(y,y) can 
take only the value +1 or —1, a path consists of P ares and N ares. A 
junction of these arcs is called a PN 
corner if with increasing time the 
switching is from +1 to —1 for the 
forcing function. Similarly, a junc- 
tion is called an NP corner if the 
switching is from —1 to +1. A 
path is called canonical if it contains 
no NP corners above the y axis and 
no PN corners below theyaxis. The 
importance of the canonical path is 
that a minimal path, t.e., a path of Fie. 10.24 
minimum time, must be canonical. 
That is, given any path A from a point p which is not canonical, one can 
find a canonical path from p which is shorter than A in terms of time. 
This can be shown quite easily. Given, say, a path with the NP corner 
p above the y axis, as shown in Fig. 10.24, one denotes by p’ either the 
last corner of the path preceding p or the last intersection preceding p 
of the path with the y axis, whichever is nearer p, and one denotes by p” 
the corresponding point following p. We then draw the P curve forward 
from y’ and the N curve backward from p”. Now, from the basic equa- 
tions of Eq. (10.17), we have 


dy _ —9y,y) + eù) (10.18) 
dy y 


Therefore, at any point in the phase plane, the slope of the P curve is 
always greater, algebraically, than the slope of the N curve. Thus the 
configuration of the paths must be like that indicated in Fig. 10.24. If 
we now modify the given path by replacing p’pp” by p’p’’’p”, the NP 
corner p is removed, and the path is made canonical. If we denote by 
t(p’pp'’) the time interval in going from p’, through p to p”; and by 
t(p'p’"'p’), the time interval along the canonical path, then 


1D. W. Bushaw, Experimental Towing Tank, Stevens Institute of Technology 
Report 469, Hoboken, N. J. 1953. 
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But at any y, the value of ý on the canonical path is greater than the 
value of y on the original path, and therefore t(p'p”'p”) < t(p'pp”). 
Thus the canonical path is “shorter” than the noncanonical path. 

As a simple example of the application of the theory of the optimum 
switching function, let us take g(y,y) = ¢y. Then the system of Eq. 
(10.17) becomes 


nicer: sal y 
n (10.19) 


| 
l 


The P system and the N system of arcs depend, of course, on the value 
of ¢; but since Eq. (10.19) does not explicitly contain y, these systems of 


t=0 
Fie. 10.25 


ares consist of parallel curves shifted along the y axis. A typical P are 
and an N are with one branch passing through the origin, for each of the 
three possible values of £, are shown in Fig. 10.25. The case of ¢ < 0 
is different from the other cases, in that in order to reach the origin the 
initial value of y must be within the range —1/¢ to +1/¢. For this 
case then, the problem of optimum switching has meaning only if the 
initial y is within this specified range. 

We shall denote by T that part of the P curve through the origin which 
lies below the y axis, and by I~ its reflection about the origin. T- is 
therefore that part of the N curve through the origin which lies above 
the y axis. T and T- together gives a curve C. Bushaw showed that 
C is the optimum switching line in the sense that above C the optimum 
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switching function o(y,v) should be — 1, and below C the optimum switch- 
ing function ¢({y,y) should be +1. This is indicated in Fig. 10.26. 
Physically, the switching is done as follows. From any point p above C, 
the forcing function should be —1, and the system follows the N arc 
to the switching line C. There the forcing function changes to +1, and 
the system follows C to the origin. If the initial point p is below C, the 
forcing function should be +1, and the system follows the P arc to the 


Fig. 10.26 


Fig. 10.27 


switching line C. There the forcing function changes to —1 and the 
system follows C to the origin. 

That Bushaw’s solution for the optimum switching line is correct 
can be seen as follows. First of all, we note that in order to reach the 
origin, we must use C for the last part of the path, because C is the only 
curve through the origin. Now suppose that the initial point is above C 
and that the optimum path, according to Bushaw, is that indicated in 
Fig. 10.27(a) by the single N arc from p to C and then the path along C 
to the origin. Then if the switching is done too early, we have an NP 
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corner before reaching C. In order to reach C we have to switch again 
and make a PN corner. If this switching is done at p’ when 7 is still 
negative, then we have a PN corner below the y axis; this violates the 
rule for a canonical path. The time along the modified path is definitely 
longer than the optimum path. If the PN corner is made at p” when y 
is positive, the time required will be even larger, because the path then 
has a closed loop. Thus switching too early is disadvantageous. Figure 
10.27(b) shows the case of switching too late. Since the paths p’0 and 
pp” are equivalent and thus require the same time interval, it is easily 
seen from the figure that switching too late is also detrimental. Figure 
10.27(c) shows yet another variation, where the first part of the path is 
a P arc instead of an N arc. But itis apparent from the figure that this 
variation is also worse than the opti- 
mum path. These considerations indi- 
cate the correctness of choosing the 
canonical path as the optimum switch- 
ing line. 

10.8 Optimum Switching Line for 
Linear Second-order Systems. 
Bushaw has determined the optimum 
switching line for the linear second- 
order systems with g(y,y) = 2¢y + y, 
for all real values of ¢. We shall only 
state his result here without proof; but 

Fre. 10.28 in view of our discussion of the simple 
case in the preceding section, the general character of the result can be 
easily understood. The P system and the N system for this g(y,7) are 
simply the family of curves in Figs. 10.12 to 10.16 with the origin shifted 
to (+1,0) and (—1,0), respectively. 

The case nearest to our simple example is the case of ¢ > 1. Then 
the switching line C consists of the P arc from infinity to the origin of the 
phase plane, and the N arc from infinity to the origin. Again, above C, 
the switching function ¢ takes the value —1; and below C, ¢ is equal to 
+1. The optimum path from an initial point above C is thus as indi- 
cated in Fig. 10.28. 

When ¢ < —1, then, as in the simple example, only from points within 
a limited region of the phase plane can the system reach the origin; 
because without the forcing function the system is unstable. Bushaw 
specifies the boundaries of this region as the P arc from the point (+1,0) 
to the point (—1,0) and the N arc from the point (—1,0) to the point 
(+1,0), as shown in Fig. 10.29. The switching problem has meaning 
only for initial points within this region. The optimum switching line C 
consists of the P arc to the origin and the N arc to the origin. Above C, 
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the switching function ¢ is equal to —1, and below C, gis +1. The 
optimum path from an initial point p above C is shown in Fig. 10.29. 

When ¢ = 0, the P system and the N system are circles with the 
centers (+1,0) and (—1,0), respectively. The optimum switching line C 
(Fig. 10.30) is a series of semicircular 
ares of radius unity, starting at the 
origin, and stretching along the y axis 
in both directions. The arcs are 
below the y axis for positive y and 
are above the y axis for negative y. 
Above the switching line C, the switch- 
ing function ø is equal to — 1; below 
it, ¢ is +1. Thus, starting from a 
point p as shown in Fig. 10.30, the path 
follows an N are, or a circular are with 
its center on (—1,0). When the path 
intersects C at a, the path becomes a P 
arc, or a circular arc with its center on (+1,0), until the path intersects C 
againatb. Atb,the path changes again into an N are, until the next inter- 
section with the switching line C, ete. The last intersection with C is d, 
and, from there, the path follows Cinto the origin. Thisis a considerably 
more complex switching performance than that for ¢ > 1. 


Fic. 10.29 


Frc. 10.30 


The next more difficult case is the case of converging spirals, or 0 < ¢ 
<1. For this case, Bushaw showed that the optimum switching line 
should be constructed as follows: We first draw a P spiral backward in 
time, starting at the origin. The first arc of C is the first arc of P, from 
the origin to the first intersection with the y axis. Then we draw the 
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reflection of all arcs above the y axis about their right point of intersection 
with the y axis. Then we assemble all these arcs below the y axis into a 
continuous curve by moving the consecutive ares parallel to the y axis 
until they join end to end, starting from the origin and extending to 
the right. This is the positive half of the optimum switching line. The 
negative half of the switching line is then obtained by reflection about 
the origin. Again, above the switching line C (Fig. 10.31), the switching 
function ø is —1; below it, gis +1. The situation is then very much 
the same as for the case ¢ = 0 shown in Fig. 10.30; the only difference 
is the replacement of circular arcs by spiral arcs. 


Fre. 10.31 


The last case is the case of diverging spirals, or -1<¢<0. The 
method of construction of the optimum switching line is exactly: the 
same as in the preceding case, except that now the consecutive spiral arcs 
diminish in size instead of increasing. The length of the switching line 
is thus finite, spanning the y axis from (—a,0) to (+a,0), as shown in 
Fig. 10.32. This is as it should be; because here we have negative 
damping, and, as in the case of Fig. 10.29, the path can reach the origin 
only if the initial point is within a certain region near the origin. In 
fact, the boundary of this region consists of the P arc from the point 
(a,0) to (—a,0) and the N arc from the point (—a,0) to (a,0). The 
optimum switching function ¢ takes the value —1 for points above C in 
the phase plane, and the value +1 for points below the switching line. 

The boundary curves of the regions of possible optimum switching 
as indicated in Figs. 10.26 and 10.29 are evidently the limit cycles with 
switching points at y = 0. They each then represent periodic solutions 
of the relay servomechanism considered. It is, however, equally appar- 
ent that these periodic solutions are unstable: the slightest disturbance 
will cause the lines of behavior of the system to spiral to the origin or to 
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diverge to infinity. Therefore such periodic solutions cannot occur in 
reality. 

One property of our solution for the optimum switching problem stands 
out: for all cases, the optimum switching function ø takes the value — 1 
in the first quadrant of the phase plane and takes the value +1 in the 


7 
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third quadrant. By writing Eq. (10.19) in the following form, 


we can understand this feature of the solution quite easily. The purpose 
of the design is to return to the state y = 0, or the ¢ axis, in the shortest 
possible time. When y and dy/dt are both positive, this purpose can be 
accomplished by making d?y/di?, or the curvature of the y(t) curve, as 
negatively large as possible, that is, e should be —1. When y and dy/di 
are both negative, d?y/dt? should be as positively large as possible, that 
is, ¢ should be +1. This intuitive reasoning agrees with our result for 
the optimum switching function. When y and y have different signs, 
the optimum value of ¢ cannot be so simply determined, because then 
the rate of return to the t axis depends upon the complicated interaction 
between y and y. Bushaw’s contribution is then to specify the optimum 
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g just in these regions, the second and the fourth quadrants of the phase 
plane. But it is clear from this discussion that the optimum switching 
line C must lie in the second and the fourth quadrants. 

For systems of higher order and for systems of more than one degree 
of freedom, the phase-plane representation of the state of the system is 
no longer possible. We have to use a phase space of many dimensions. 
By analogy with the problem already discussed, we may expect the 
solution of optimum switching of such complicated systems to be the 
determination of optimum switching surfaces in the phase space. How- 
ever, such problems have not yet been solved; only an initial attempt 
has been made by Kang and Fett.! 

10.9 Multiple-mode Operation. What happens when the system is 
guided by our switching action to the origin of the phase plane? Clearly, 


| Output 
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if the forcing function is continued at the value obtaining just before 
the origin is reached, the system will move away from the desired state 
again. Butas soon asit moves away from the origin, the switching action 
designed into the system will function and drive the system back towards 
the origin. The net result will be a rapid approach to the origin from 
any disturbed position, and then a high-frequency oscillation or chatter- 
ing around the origin. 

If small deviations from the rest state or the origin are not objection- 
able, the chattering near the origin can be eliminated by cutting off the 
forcing function whenever the system is near the rest state, 7.¢., whenever 
y and y are small enough to be negligible. The system then has two 
modes of operation: for large deviations, the switching action together 
with the output of the relay functions; for small deviations, such input 
to the system is cut off. We shall see the necessity of such multiple- 
mode operation of servomechanisms again in the later chapters. 

The optimum switching line, being nonlinear, cannot be implemented 
by a simple linear circuit. In fact, the measured value of the output y(t) 


! C. L. Kang and G. H. Fett, J. Appl. Phys., 24, 38-41 (1953). 
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has to be “digested” by a nonlinear device, or a computer. The com- 
puter is designed to generate the switching signal for the relay according 
to the optimum switching line. Furthermore, there should be another 
cutof computer to disconnect the relay output from the system when the 
output y and y are small enough. Therefore the block diagram of such 
a relay servomechanism is that of Fig. 10.33. The incorporation of 
computers into control systems will be generally necessary for the more 
complicated systems discussed in the following chapters. However, 
conceptually, this really involves nothing new—the compensating circuit 
used in a conventional servomechanism for modifying the system transfer 
function is also a computer. But in these simpler systems, the computing 
function can be performed by a linear circuit, such as an RC circuit. We 
shall give a more extensive discussion on computers in Chap. 13. 


CHAPTER 11 


NONLINEAR SYSTEMS 


A nonlinear system is a system for which the output is not linearly 
proportional to the input. The relay servomechanism is a simple exam- 
ple of such nonlinear systems. In Chap. 6, we have given a general 
method of linearizing any nonlinear servomechanism, t.e., any nonlinear 
system can be made to behave like a linear system by modifying the 
system into an oscillating-control servomechanism. In the preceding 
chapter, we have presented a method for analyzing a servomechanism 
including a nonlinear device whose behavior is insensitive to the frequency 
of the input. These methods for designing nonlinear servomechanisms 
encompass a wide variety of nonlinear problems in engineering practice 
and are quite sufficient for dealing with the usual systems synthesis 
problems. 

On the other hand, as indicated in the later sections of the last chapter, 
the problem of optimum utilization of the nonlinear characteristics of a 
system to improve the performance of the system is generally a much 
more difficult problem than the problem of design for stability. In fact, 
only a modest beginning has been made in this direction. It is thus not 
possible at the present time to give a satisfactory treatment of the sub- 
ject of general nonlinear servomechanisms. Furthermore, the problem 
should perhaps be formulated more directly and in a different way. 
Instead of finding the performance of an assumed system, we should 
specify the performance of the system, and then determine the required 
nonlinearity. This approach will be discussed in Chap. 14. The scope 
of this chapter is quite limited: we shall indicate only a few possibilities 
of purposefully utilizing the characteristics of a nonlinear system. 

11.1 Nonlinear Feedback Relay Servomechanism. If we confine our- 
selves to cases where the deviations from the equilibrium state are small, 
the results of Bushaw stated in Sec. 10.8 for the optimum switching line 
of a relay servomechanism can be greatly simplified. It is seen from the 
discussions in that section that near the origin the switching line C is 


approximated by 
gly| = —2y (11.1) 


This indicates that an improvement in the performance of the system 
over that of a system with linear switching (Sec. 10.6) can be achieved 
160 
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by using nonlinear switching defined by Eq. (11.1). If xis the input and 
y the output, Eq. (11.1) should be modified into 


aġġ] = (z — y) (11.2) 
sgn (x — y) viz — y| = ay (11.3) 


where a is a constant. 

The block diagram of a relay servomechanism utilizing the switching 
condition of Eq. (11.2) is shown in Fig. 11.1. This method of ġlġl 
feedback was proposed by Uttley and Hammond! for improving the 
performance of the simple relay servomechanism. The computer here 
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is only required to generate the signal a?y|y| and thus can be relatively 
simple. The equivalent switching condition of Eq. (11.3) can also be 
implemented by the ý and sgn (x — y) Viz — y| feedback. Since 
(x — y) is the error, the system may be called sign error root-modulus 
error (SERME) system. The block diagram is shown in Fig. 11.2. Here 
again, the computer is relatively simple. This system was proposed by 
J. C. West.? 

These nonlinear feedback relay servomechanisms, although compara- 
tively simple, cannot be analyzed rationally. In fact, our argument in 
their favor is only a plausible one, not a complete one. The final design 


1A, M. Uttley, P. H. Hammond, “Automatic and Manual Control,” p. 285, edited 
by A. Tustin, Butterworth & Co. (Publishers) Ltd., London, 1952. 

2 J. C. West, “Automatic and Manual Control,” p. 300, edited by A. Tustin, 
Butterworth & Co. (Publishers) Ltd., London, 1952. 
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of any particular system using these principles has to be determined by 
actual experimentation. 

11.2 Systems with Small Nonlinearity. If a system of n degrees of 
freedom is not linear, then instead of a linear differential equation, such as 
Eq. (2.3), we have 


qr- ty 
= be Oy i Bore fb a, + Qoy 


d qm 
T als a ei 4) = x(t) (11.4) 


where the a’s and u are constants, z(t) is the input, y(¢) is the output, and 
f is a nonlinear function of its variables. The first part on the left of 
Eq. (11.4) is thus the linear differential operator, as in Hq. (2.3). All the 
nonlinearity of the system is represented by the last term on the left 
of the equation. Small nonlinearity means that u is small in comparison 
with the a’s. 

For small nonlinearity then, we may try a formal expansion of the 
solution in a power series of p: 

YO = yO + ny + YO + >> (11.5) 

By substituting Eq. (11.5) into Eq. (11.4) and by equating equal powers 
of u, we have 
dry 7-100) 
ae tp 
dry qr—ty) 
a T O a 


dy 
sof a= + ay = (Ð (11.6) 


(1) 
' +a w + ay" 
qr-} (0) 
= -(9°, ee Se) (11.7) 


and other equations for higher-order terms. The “zeroth-order” approx- 
imation is thus the linear equation, as in Eq. (2.8). But more important 
is the fact that the first-order correction term due to nonlinearity is 
determined by Eq. (11.7), an equation of exactly the same character- 
istics as the zeroth-order approximation. In other words, if the linear 
approximation shows that the system is damped and has other desired 
properties of a servo system, then the first-order correction y® (A will 
also have such properties. Moreover, because of the occurrence of the 
small parameter u before y(t) in the expansion of Eq. (11.5), the correc- 
tions for nonlinear effects are small. Consequently, small nonlinearity 
in a system of “satisfactory” performance will not alter essentially the 
system behavior from its linear approximation. Therefore, as far as an 
engineering approximation is concerned, we can treat such systems as 
linear systems. This is the reason that the linear theory of servomech- 


+ 


NONLINEAR SYSTEMS 163 


anisms has had such good success in spite of the ever present small non- 
linearity in even a “linear” system. 

On the other hand, if the damping of the linear approximation is very 
small, then we know that there is the possibility of resonance. That is, 
even if the input z(t) is of order unity, the output y(t) of the linear 
approximate system of Eq. (11.6) can be very much larger than unity. 
gemt 


*. + 3 dint 


When this is the case, the quantity pf (w, ; ) or the non- 


linear effects, can be of the same order of magnitude as some of the linear 
terms, even with small u. In other words, our formal expansion pro- 
cedure of the last paragraph is not justified, and we must expect strong 


Amplitude 


A ` 
-m —y Ao 
(a) (b) 
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Frc. 11.3 


effects from even small nonlinearity if the system has only very weak 
damping. In the following sections, we shall give a brief description 
of the kaleidoscopic behavior of a nonlinear system. Detailed treatment 
of such phenomena in nonlinear mechanics can be found in the excellent 
books by Minorsky! and Stoker.? 

11.3 Jump Phenomenon. As already stated in Sec. 10.2, self-excited 
oscillation in a system, if it is automatically built up from very small 
deviations from the equilibrium state, is called ‘“‘soft” self-excited oscilla- 
tion; if it has to be started by large deviations from the equilibrium state, 
the oscillation is called “hard” self-excited oscillation. In some cases, 
the coefficients of the differential equation of the system may depend 
upon a parameter à of the system. If, for some particular value of A, the 
critical Mo, the character of the equilibrium state changes from that of a 
stable state to an unstable state, then the limit cycle or steady oscillation 
will appear. According to whether the system is one of soft or hard self- 
excitation, the phenomenon occurs as in either Fig. 11.3a or b. In the 


1 N. Minorsky, “Introduction to Nonlinear Mechanics,” Edwards Bros., Inc., Ann 
Arbor, Mich., 1947. 

2 J. J. Stoker, “Nonlinear Vibrations,” Interscience Publishers, Inc., New York, 
1950. 
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first case, if the parameter \ increases, nothing happens until à reaches 
the value Ao, at which point the equilibrium state changes from stability 
to instability with a simultaneous appearance of a stable limit cycle whose 
amplitude begins to increase with à. If \ is made to decrease, the phe- 
nomenon retraces its path exactly, and the limit cycle disappears when 
A =o For a system with hard self-excitation, the picture is differ- 
ent (Fig. 11.3b). The oscillation appears suddenly at A = Ao with a 
finite amplitude, and, with increasing 4, the amplitude increases. If à 
decreases, the oscillation or limit cycle does not disappear when \ = Xo 
but disappears farther along the curve where the amplitude again jumps 
from a finite value to zero. Thus the jump phenomenon is associated 
with the hysteresis of the system behavior. 

11.4 Frequency Demultiplication. If a nonlinear system is acted on 
by a periodic input containing two frequencies w; and we, the output of 
the system occurs not only with these frequencies and their harmonics 
but also with an additional spectrum of the so-called combination tones 
mw, + nw Where m and n are integers. Thus, for example, if a voltage 
£ = X(COS wit + COS wet) is Impressed on a nonlinear conductor whose 
current y is given by y = ay + ax? + azz’, then the output y(t) will 
contain the following frequencies: w1, we, 2w1, 2wa 301, 3wa w1 + we, 
w — we, 2w1 + we, 201 — we, w + 2we, and wı — 2w The first six fre- 
quencies are regular harmonics, but the last six are the combination 
tones resulting from the nonlinearity of the conductor. Some of these 
are higher, and others are lower than the original frequencies w; and ws. 
These lower frequencies are called subharmonics, and the process by 
which they are obtained is called frequency demultiplication. 

It is seen that if wı and we are fairly close together, w: — we can be 
much smaller than either of the original frequencies. If, in addition, 
the system can be stabilized at such a low frequency, subharmonics of 
the order of zg of the input frequency, and even lower, can be obtained. 
If several such systems are connected in series, with the subharmonic 
output of one serving as input to the other, then still lower frequencies 
can be reached. 

11.5 Entrainment of Frequency. If a nonlinear system has a self- 
excited oscillation of frequency wi, then when the system is subjected 
to an input of a slightly different frequency we, we expect the simultane- 
ous occurrence of both w and we, and, through nonlinear interaction, the 
beat frequency wz — 1. In reality the phenomenon develops as shown 
in Fig. 11.4. The frequency we — w; disappears suddenly as soon as 
we reaches a certain zone of synchronization AB. In this interval there 
exists only one frequency we, and everything happens as if the original 
frequency w, were entrained by the variable frequency ws. 

This phenomenon of frequency entrainment was first explained by van 


NONLINEAR SYSTEMS 165 


der Pol and was extended by others. Let the system be one of second 
order and determined by the differential equation 


3 
We CH +y a + wiy = Bo? sin wot (11.8) 


where a, y, and B are positive constants. If B = 0, the system has 
negative damping for small amplitudes of oscillation, but positive damp- 
ing for large amplitudes of oscillation. Thus there is an amplitude at 
which the system can maintain a 
steady oscillation. Furthermore, if 
both œ and y are small, then steady 
self-excited oscillation must occur at 
a frequency close to wi Van der 
Pol showed that when we is close to 
w1, the solution of Eq. (11.8) can be 
written as 


y(t) = b(t) sin wet + ba(t) cos wat 
(11.9) 


Fre. 11.4 


and that 6,(¢) and ba(t) are slowly varying functions of t, such that, for 


example, 


a K 


di web (t) 


By substituting Eq. (11.9) into Eq. (11.8) and retaining only the terms 
up to first order, we can write the equations for bı and bz as 


Di = f'1(b1,b2jw2) 
(11.10) 
a fo(b1,b2jw2) 


These are autonomous equations of the first order. Therefore they can 
be solved by the method of isoclines in the same manner as a second- 
order system in the phase plane. Analysis shows that for a certain range 
of we near w, the system of Eq. (11.10) has a stable node point in the 
bib, plane. Thus, no matter what the initial values of bı and be are, 
the system tends to a fixed set of values of b, and bz corresponding to 
this point. Thus no oscillations of the frequency w: — wı appear at all. 
For we outside this range, there is a stable limit cycle in the bib, plane 
which accounts for the observed effect shown in Fig 11.4. 

11.6 Asynchronous Excitation and Quenching. In some nonlinear 
systems it is possible either to start or to stop an oscillation of frequency 
w by means of another oscillation of an entirely different frequency w. 
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In the first case, the phenomenon is called asynchronous excitation, and 
in the second, asynchronous quenching. An understanding of these 
phenomena can be obtained by recalling the fact that the mere existence 
of a limit cycle in the phase space of a system does not necessarily mean 
the occurrence of steady oscillation. For steady oscillations to take 
place, the limit cycle must be stable in the sense that the system has a 
tendency to return to the limit cycle, even if the system is disturbed 
and displaced to a point away from the limit cycle in the phase space. 
An unstable limit cycle cannot be realized in a physical system. In 
view of this, one can readily see that the appearance of a new oscillation 

may, under certain circumstances, either create or destroy 
ears cost the conditions of stability of another. In the first case we 

have the phenomenon of asynchronous excitation and in the 

second that of asynchronous quenching. The term asyn- 

chronous merely emphasizes the fact that the relation 

i between w and w is purely arbitrary. 

11.7 Parametric Excitation and Damping. It has been 
known for a long time that if a parameter of an oscillating 
system is varied periodically with a frequency w, the system 
begins to oscillate at the frequency w/2. Lord Rayleigh 
demonstrated this effect by attaching one end of a stretched 
wire to a prong of a tuning fork. If the fork oscillates with frequency 
w, lateral oscillations of the wire appear with frequency w/2. A similar 
problem is that of a simple pendulum, a mass supported by a weightless 
bar, under the influence of a sinusoidal force applied at the end of the bar 
(Fig. 11.5). If @ is the small angular displacement of the pendulum from 
the vertical, and, without loss of generality, if the frequency of the sinus- 
oidal force is unity, the differential equation for @ is then 


mg 
Fie. 11.5 


26 
ml z + (mg + a cos t)6 = 0 


where m is the mass, g the gravitational attraction, | the length of the 
pendulum, and a the amplitude of the periodic force. This equation 
can be written as 

d*6 

dt? 
a is thus equal to g/l and 6 is equal to a/ml. The case of an inverted 
pendulum, with the mass above the point of support, can also be repre- 
sented by Eq. (11.11) by taking g as negative. Therefore, for a normal 
pendulum a is positive, and for an inverted pendulum a is negative. 
Equation (11.11) is actually a linear equation but with periodic variation 
in the end force applied to the pendulum. Thus the system can be 
considered as one with periodic variation of its parameter. 


+ (a + B cos tjé = 0 (11.11) 
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Equation (11.11) is the well-known Mathieu differential equation. 
The stability of the solution is determined by the constants a and 8. In 
fact, the a8 plane can be divided into a region of stability and a region 
of instability as shown in Fig. 11.6 (where the stable region is shaded). 
It is thus seen that for positive a, the case of the normal pendulum, the 
system is stable when the periodic end force is absent, or 8 = 0. This is, 
of course, well known. However, it is interesting to note that with 
appropriate 8, the system can be made unstable. Then the pendulum 
will swing with increasing amplitude until, finally, nonlinear effects limit 
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the amplitude at a fixed large value. This is the phenomenon of para- 
metric excitation. For negative values of a, the case of inverted pendu- 
lum, the system without a periodic end force is naturally unstable. But 
for a certain narrow range of 8, the system can be stabilized. This phe- 
nomenon is called parametric damping. 

Parametric excitation or parametric damping can occur in any system 
with periodic variation in the parameter of the system. This phenome- 
non or any of the previously shown nonlinear phenomena can be utilized 
to achieve the desired performance of a control system. In fact, many 
of them are already incorporated into the many components of a servo con- 
trol system. However, these nonlinear components are mere “gadgets,” 
and they are “designed” rather by experience and testing than by theo- 
retical analysis. The application of the characteristically nonlinear 
phenomena to the over-all design of a control system is as yet an unex- 
plored field. Our discussion in the preceding sections serves only as an 
indication of its rich possibilities. 


CHAPTER 12 


LINEAR SYSTEM WITH VARIABLE COEFFICIENTS 


The only system with time-varying coefficients considered in detail 
in the previous chapters is the pendulum with a periodic force at the 
supporting end, discussed in connection with the phenomena of para- 
metric excitation and damping. All other systems considered do not 
have coefficients of their differential equations that are explicitly func- 
tions of time. We have, however, shown in Chap. 1 that linear systems 
with time-varying coefficients can have behavior entirely different from 
that of systems with constant coefficients. In this chapter, we shall again 
take up this question and discuss in some detail such a typical but simple 
system: the short-range artillery rocket. We shall demonstrate that the 
question of stability of such a system with variable coefficients cannot be 
solved in the same manner as for the linear system with constant coeff- 
cients. Not only is the method of Laplace transform and transfer func- 
tion useless for the purpose, but we are forced to change our entire 
approach to the problem. 

We shall study the motion of a fin-stabilized artillery rocket during 
the period of action of the rocket thrust. We shall be particularly 
concerned about the angular deviations of the rocket axis from the launch- 
ing angle caused by disturbances during the launching and the subse- 
quent damping action of the fins. The general problem of the dynamics 
of artillery rockets was studied in great detail by various authors in 
different countries during World War II. The American work is sum- 
marized by Rosser, Newton, and Gross.1 The work done in England 
is reported by Rankin.? Carriére’s paper? represents a French investi- 
gation of the same subject. Our discussion here will be greatly simplified 
and has the purpose of bringing out only the salient points of interest 
to our study of linear systems with variable coefficients. 

12.1 Artillery Rocket during Burning. For a fin-stabilized artillery 
rocket, the interaction of motion in the vertical plane and the horizontal 
plane is negligible, t.e., there are negligible aerodynamic forces in the 


1 J. B. Rosser, R. R. Newton, and G. L. Gross, “Mathematical Theory of Rocket 
Flight,” McGraw-Hill Book Company, Inc., New York, 1947. 
? R. A. Rankin, Trans. Roy. Soc. London (A), 241, 457-585, (1949). 
3 P. Carrière, Mém. artillerie franç., 25, 253-360 (1951). 
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vertical plane produced by the motion in the horizontal plane. There- 
fore the characteristics of the rocket are not lost by confining our con- 
siderations to the vertical plane and studying the motion in that plane 
only. We shall consider the earth to be flat for short-range artillery 
rockets. Let v be the magnitude of the velocity of the rocket, @ the 
inclination of the velocity vector, and ¢ the inclination of the axis of the 
rocket (Fig. 12.1). Then the angle of attack a of the rocket is 


a=o-4 (12.1) 


Let m be the mass of the rocket, and g the gravitational constant. The 
gravitational attraction is thus a vertically downward force mg. The 


Fie. 12.1 


aerodynamic forces are the lift L, the drag D, perpendicular and parallel 
to the direction of motion, respectively, and the moment M. All these 
forces act on the center of gravity of the rocket. The constant thrust 
S of the rocket motor may have an angular misalignment 8 and a moment 
arm 6 with respect to the center of gravity, as indicated in Fig. 12.1. 
Then the equation for acceleration along the trajectory is 


m? = 8 cos (a — B) — mg sin 0 — D (12.2) 
The equation for acceleration normal to the trajectory is 
moo = Ssin (a — 8) — mg cos ð + L (12.3) 


Finally, if k is the radius of gyration of the rocket about a lateral axis 
through the center of gravity, the equation of angular acceleration is 


me$ = 86 — M (12.4) 


In Eq. (12.4), we have neglected the so-called jet damping moment since 
it is small in comparison to the restoring moment of the tail fins. 
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The aerodynamic forces and the moment are dependent upon the 
angle of attack a. But if the fins on the rocket are not properly aligned, 
the lift and moment will not vanish even if a is zero. The misalign- 
ment can be accounted for by introducing a misalignment angle y, such 
that L and M vanish not ata = 0 but at a = y. If pis the density of 
air and d the diameter of the rocket body, we can introduce the lift 
coefficient Kz, the drag coefficient An, and the moment coefficient Kx 
as follows: 


L = Kzrpv?d? sin (a — y) (12.5) 
D = Kypo°d? (12.6) 
M = K upv’°d? sin (a Bie y) (12.7) 


For short-range artillery rockets, the summit of the trajectory is not high; 
therefore the density p can be taken as a constant. Furthermore, the 
maximum velocity is small enough so that all the coefficients Kz, Kp, 
and Ky can be considered as constants, not influenced by the Mach 
number on the trajectory. Moreover, for short-range rockets, the pro- 
pellant fraction of the total mass is small; then the mass m of the rocket 
can be taken to be a constant without serious error. Equations (12.1) 
to (12.7), together with these simplifying assumptions, then determine 
the trajectory of the rocket. 

The burning time of this type of rockets is very short, say $ of a 
second. This necessarily makes the acceleration S/m very large. In 
fact, the thrust S is so large that the gravitational and drag forces are 
negligibly small in comparison. Furthermore, the deviation of thrust 
line from the flight direction, a — £, is never large. Therefore an imme- 
diate zeroth-order approximation of Eq. (12.2) is 


Therefore the zeroth-order approximation to the trajectory is given by a 
straight line with the initial inclination 6; (Fig. 12.1). The motion along 
this straight line is one of constant acceleration S/m. Thus if z is the 
distance measured along this line, then the motion is represented by 


2 
jie gs (5) ? (12.9) 


m m 
If the rocket is launched without any initial velocity, then z is the true 
distance from the launching point. If there is initial velocity, z is not 


the distance measured from the launching point, but from some point 
ahead of it. From Eq. (12.9) we have 


d_ dd d_ |B a 
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With this zeroth-order solution, we can calculate the first-order solution 
of the rocket trajectory. This will be done presently. 

12.2 Linearized Trajectory Equations. Since the deviation of the 
trajectory from the zeroth-order solution during burning is never large, 
we can replace the velocity v and the time derivative whenever they 
occur in Eqs. (12.3) and (12.4) by those given by Eqs. (12.9) and (12.10). . 
Furthermore, since a — 6 is small, sin (a — £) can be replaced by a — 8. 
cos 6 can be replaced by cos 6;. We shall also neglect the lift L, since 
it is small in comparison to the lateral thrust component and the weight 
of the rocket. With these simplifications, Eqs. (12.3) and (12.4) become 


22 = (a — B) — m$ cos 6; (12.11) 
and 
d? d ò 8r? 
2 s = 5-5 ela n) (12.12) 
where o is defined by 
2 Ag? k?°m 
o? = Ar ETL (12.13) 


and evidently has the dimension of a length. o can be taken as the 
characteristic length of the disturbed motion of the rocket, and may be 
considered as the wave length of the disturbed trajectory. Equations 
(12.1), (12.11), and (12.12) are then linearized equations for the three 
unknowns a, 6, and ¢. Linearization is made under the assumption of 
small deviations from the straight-line trajectory at the ideal launching 
angle 6;. 

We can eliminate 6 and ¢ to obtain a single equation for a To do 
this, we divide Eq. (12.11) by 2 +/z and differentiate the resultant equa- 
tion with respect to z. Then we have 


a6 dé | da i gm 
Vi T via ie KTA 
Now we divide Eq. (12.12) by 2+/z and subtract from it the above 


equation. Then, using Eq. (13.1), ¥ we have 


la 1 da 4g? ~/2 1 _ 6 
Vig tet (TP a 


This equation clearly demonstrates the fact that the controlling differ- 
ential equation of the stability of an artillery rocket is not an equation 
of constant coefficients. In fact, this equation can be put into the 
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standard form of Bessel’s equation by introducing the nondimensional 
distance & defined by 


ieee (12.14) 


where o is the “wave length” specified by Eq. (12.13). Then we have 


da , lda ls (2): 
atest p Janrt inl) ¥ 
1 gm 1 
~ 4 (3+ 2 cos 6) 4 (12.15) 


When a is determined, 6 can be calculated by integrating the following 
equation, obtained from Eq. (12.11): 


. = ae —B- “a COS | (12.16) 


The independent variable z or & in the differential equations is not a 
time variable but a distance variable. However, since, as shown by 
Eq. (12.9), z and thus £ are monotonic functions of t, the stability of the 
system is not modified by changing the independent variable from ż to £; 
if the system is stable in terms of £ it is stable in terms of ¢, in the sense 
that the deviations from the ideal straight-line trajectory will decrease 
with increase in é or t. Therefore the problem of stability can be ade- 
quately discussed with the variable & ¢ increases from the initial value 
£,at¢ = 0 as time increases. The initial value & is zero if the launching 
velocity is zero. 

12.3 Stability of an Artillery Rocket. To discuss the question of sta- 
bility, we must solve Eqs. (12.15) and (12.16) with the specified initial 
conditions and then determine whether the angle of attack a or, more 
appropriate, the deviation of the inclination of the trajectory, 6 — 6; 
tends to zero as £ increases. Equation (12.15) is actually Bessel’s equa- 
tion for the order 4. The complementary functions are thus Bessel 
functions of order 4 and — 3. They are, however, expressible in terms 
of elementary functions. In fact, Eq. (12.15) can be rewritten as 


ang 
dé += RG) (12.17) 
where 

f= Vea (12.18) 


and 


Q) = rVit (en) a (8+ g cos a)y (12.19) 


Therefore the complementary functions for ¢ are simply sin £ and cos £. 


VARIABLE-COEFFICIENT LINEAR SYSTEMS 173 


The conditions of the rocket as it leaves the launcher, or the initial 
conditions, are 


Vv = Vo 

6 = A 

re, (12.20) 
and do/dt = (d¢/dt), | 


The subscript » thus indicates quantities at the instant t= 0. The 
initial values of £ and ¢ are thus 


2 2 
ty = = aah = TR (12.21) 
and 
fo = V ko ao (12.22) 


By using Eq. (12.16), we have at t = 0 


d6\ _ a — Ê — (mg/S) cos 6; 
Vie (ae), 2 / to 


But 6 = ¢ — a, thus 


doy i t  _(&\ _ dé\ , 8+ (gm/S) cos 6; 
ve (z), ý 2 V Eo j (3), Vi 3) 4 2 ~/ ko 


or explicitly, then 


dN _ pp alde/dt)o _ B + (gm/S) cos 8; 
( ie) Vo a F N (12.23) 


With the initial conditions so translated, we can write down the solu- 
tion of Eq. (12.17) or ¢ or a immediately, 


Ge 7 cos (E — £) E = Í | sin (n — £1) Ql) in| 
1 
Jt 


where Q is the forcing function specified by Eq. (12.19). Since Q(») 
contains half powers of n, the integrals of Eq. (12.24) are actually Fresnel 
integrals, When ais calculated, Eq. (12.16) then gives @ by quadrature, 


peen (ie) a Í sR in| (12.24) 
0 Eo 


0 — 6; = (bo — b) — L(6 -+ y cos a) log -È + A as dy [(12.25) 


We can further separate the effects of different disturbances at the 
launcher by writing Eq. (12.25) in a series of terms, each representing 
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one type of disturbance. Thus 


E E S (s + = cos a) Galé, Eo) + 5a Galé, £0) 


g 


— y[Gi(E,£o) + Gs(E,£o)] + aoGs(E, o) + 52 a GalE to) (12.26) 


TVG 


The first term represents the effect of the initial deviation of the trajectory 
angle. The second term gives the effect of thrust misalignment and 


Gili, čo) 
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gravitational pull. The third term shows the influence of the turning 
moment of the thrust. The fourth term gives the effect of fin misalign- 
ment. The fifth term represents the effect of the initial angle of attack. 
The last term gives the influence of the initial angular velocity of the 
rocket. Each of the G’s is a function of the two variables ¢ and £a, and 
is a combination of Fresnel’s integrals. Rosser and his collaborators call 
them the “rocket functions” and tabulate them in their book. Figures 
12.2 to 12.5, representing these functions graphically, are also taken 
from this book. 

An inspection of Figs. 12.2 to 12.5 immediately brings out the fact 
that all the G functions have persistent values for large & Thus the 
disturbances do not damp out. The first and the last two terms of 
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Eq. (12.26) represent the effects of the initial disturbances at the launch- 
ing point. They have finite nonvanishing values at large & The other 
terms in Eq. (12.26) are the “output” due to “input,” or the forcing 
functions. They also have values for large & The behavior of the G, 
function is even worse: at large ¢, it is approximately log £, and thus 
increases without limit. Therefore, if we use the previously established 
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criterion for stability of a system, t.e., the vanishing of initial disturbances 
and the boundedness of the output under “reasonable” forcing func- 
tions, then the artillery rocket is unstable. On the other hand, the 
complementary functions of the basic equation, Eq. (12.15), are Bessel 
functions which vanish for large values of the variable and thus seem 
to indicate stability of the system. If we try to apply our experience 
with systems which are described by a linear differential equation with 
constant coefficients, the behavior of a system with variable coefficients 
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apparently is confusing. However, this only indicates the inapplicability 
of concepts developed for systems with constant coefficients to systems 
with time-varying coefficients. A new approach to the problem of stabil- 
ity and control is required. A discussion on this point will be presented 
in the following section. 
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12.4 Stability and Control of Systems with Variable Coefficients. 
For a linear system with constant coefficients, our previous study has 
shown that the satisfactory performance of the system with respect to 
stability and other control criteria can be guaranteed if the solutions 
of the homogeneous equation without forcing function or input are all 
sufficiently damped. Therefore, although the forcing function or input 
may vary considerably from one case to another, the performance cri- 
teria and thus the design of the system are based upon the study of the 
solutions of the homogeneous equation or the complementary functions 
of the complete equation. This is the fundamental principle in the theory 
of servomechanisms. The actual method of analysis by the technique of 
transfer functions based upon the Laplace transform is merely a useful 
trick. In principle, for instance, the classical methods for finding the 
complementary functions are just as good as the root-locus method of 
Evans. 

Our preceding discussions in this chapter definitely showed that for 
a linear system with time-varying coefficients, the fact that all the 
complementary functions of the equation are damped is no guarantee 
for the satisfactory performance of the system with forcing functions. 
With some input, or forcing function, the output may even be unbounded, 
in spite of the damped complementary functions. Then the question of 
satisfactory performance of the system cannot be answered without 
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knowing the input function. This requirement of specifying the input, 
together with the difficulty of actually determining the solution of a non- 
homogeneous differential equation with variable coefficients, seems to 
make the task of developing a general theory of stability and control for 
such systems a hopeless one. However, we must distinguish the compu- 
tational difficulties from the real difficulty in organizing a general theory. 
Computational difficulties can be removed by fast electromechanical 
computers and should not be considered real difficulties. When this is 
realized, we see that specifying the input function for performance 
analysis is in fact designing for a specified purpose: we must know first 
what we want from the system under what circumstances, before we can 
design the system. When this approach is adopted, the general problem 
of stability is suppressed, because, if the system is designed to have a 
specified satisfactory performance, that in itself is sufficient. For linear 
systems with time-varying coefficients, a general theory of control design 
can be formulated on this basis. This is an application of the classical 
ballistic perturbation theory and will be discussed in the following chap- 
ter. In retrospect, we may say that the theory of conventional servo- 
mechanisms is a theory for general design of a specific type of systems. 
The perturbation theory of the following chapter is a theory of specific 
design of a more general type of systems. 


CHAPTER 13 


CONTROL DESIGN BY PERTURBATION THEORY 


The object of the ballistic perturbation theory is to calculate the behavior 
of a projectile near the so-called normal trajectory. The normal tra- 
jectory is a certain trajectory with specified initial conditions, propul- 
sion program, atmospheric conditions, and programed lift and drag. If 
the actual conditions are slightly different from these specified conditions 
or if the vehicle is disturbed from its normal trajectory by accidental 
wind gusts, the trajectory of the projectile will be different from the 
normal trajectory. But if such disturbance influences are small, the 
perturbed trajectory will lie in the neighborhood of the normal trajectory, 
and the difference of the perturbed trajectory and the normal trajectory 
will remain small. This fact of nearness to a calculated, known tra- 
jectory is the basis for the linearization of the differential equations of 
motion for the perturbed trajectory. The perturbed system is then 
represented as a linear system with time-varying coefficients, time vary- 
ing because of the varying conditions of the projectile with respect to 
time. 

The original purpose of ballistic perturbation theory was to calculate 
the small modification of the trajectory of a projectile due to deviations 
of the weight of the shell from standard value, to changes of atmospheric 
conditions, to effects of wind, etc. However, with the advent of modern 
large and fast computing machines, the tendency was to calculate all 
neighboring trajectories separately. The usefulness of ballistic pertur- 
bation theory then vanishes. However, the problem of designing the 
control for linear systems with variable coefficients is just the problem 
to which the ballistic perturbation theory can be applied. We shall 
show this in this chapter by studying the control problem of a long-range 
rocket vehicle. This particular problem was studied by Drenick.! Our 
treatment is, however, more complete and includes the problem of the 
automatic guidance of such vehicles.? 

13.1 Equations of Motion of a Rocket. In order not to complicate 
matters, the vehicle is assumed to move in the equatorial plane of the 

1 R. Drenick, J. Franklin Inst., 25, 423-436 (1951). 

2 Cf. H. S. Tsien, T. C. Adamson, E. L. Knuth, J. Am. Rocket Soc., 22, 192-199 
(1952). 

178 


CONTROL DESIGN BY PERTURBATION THEORY 179 


rotating earth, as sketched in Fig. 13.1. The planar motion is possible 
due to the absence of the cross Coriolis force in the equatorial plane. The 
coordinate system is fixed with respect to the rotating earth, ie., it 
actually rotates with the angular velocity Q, the speed of earth rotation. 
In the equatorial plane, the position of the vehicle at any time instant t 
is specified by the radius r and the angle @ from the starting point of the 


Elevator 


n A r, ý Flight Path 


Destination » 
7 
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vehicle. 79 is the radius of the earth. g is the gravitational constant 
at the surface of the earth without the centrifugal force due to rotation. 
Let R and O be the force per unit mass due to thrust and aerodynamic 
forces acting on the vehicle in the radial and the circumferential direc- 
tions, respectively. Then the equations of motion of the center of grav- 
ity of the vehicle are 


dr 

n? 

a 5 
at R+ + 0)*—9(") | 

r = 0 —24(6 + 0) 


where the plus sign in the second terms on the right will be valid for 
flights toward the east, and the minus sign for flights toward the west. 
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The forces R and © are composed of the thrust S, the lift L, and the 
drag D. Let W be the instantaneous weight of the vehicle with respect 
to g, the gravitational constant, and V the magnitude of air velocity 
relative to the vehicle. Then it is convenient to introduce the parameters 
2, A, and A, defined as 


_ Sg _ Ig _ Dg 


It will be assumed that the natural wind velocity w is in the horizontal 
direction and in the equatorial plane, with positive sign if it is a head 
wind. w is considered to be a function of altitude r. If v, 1s the radial 
velocity and va the circumferential velocity, t.e., 


Vp = f 
eee | (13.3) 


then the relative air velocity V is computed as 
Vi = +4 (6 + w)? (13.4) 


If 6 is the angle between the thrust line and the horizontal direction, then 
the radial and the circumferential components of the forces R and © per 
unit mass are 


R = Z sin 8 + (w + wA — vA (13.5) 

© = 2 cos 8 — vA — (va + w)A i 

If N is the moment of forces about the center of gravity, divided 

by the moment of inertia of the vehicle, the equation for the angular 
acceleration is 

dg dé 

a a +N (13.6) 

To specify completely the motion of the vehicle, the lift L, the drag D, 

and the moment m about the center of gravity have to be given as 

functions of time. According to aerodynamic convention, L and D will 

be expressed in terms of the lift coefficient Cz and the drag coefficient Cp 

as follows: 
L = pV? AC: 
D = spV?AC>d 


where p is the air density, a function of the altitude r, and A is a fixed 
reference area, say the wing area of the yehicle. In the present problem, 


(13.7) 
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since the motion of the vehicle is restricted to the equatorial plane, the 
attitude of the vehicle, essential for aerodynamic calculations, is deter- 
mined by the angle of attack a, that is, the angle between the thrust 
line and the relative air velocity vector (Fig. 13.1). The control on the 
motion of the vehicle is effected, however, through the elevator angle y. 
The parameters which will affect Cz and Cp are thus a and y. In addi- 
tion, the aerodynamic coefficients are functions of the Reynolds number 
Re and the Mach number M. If ais the velocity of sound in the atmos- 
phere, a function of the altitude r, the Mach number is 


M = (13.8) 


If l is a typical length of the vehicle and x is the viscosity of air, again 
a function of the altitude r, the Reynolds number is . 


Re = on (13.9) 
Therefore 
Cz = Cx(a,y,M Re) 
Co = Cr(a,y,M,Re) (13.10) 


We shall assume that the thrust line passes through the center of 
gravity of the vehicle; thus the thrust gives no moment. Since the 
angular motion of the vehicle during the powered flight is expected to be 
slow, the jet damping moment of the rocket is negligible. The only 
moment acting on the vehicle is then the aerodynamic moment m. m can 
also be expressed as a coefficient Cx as follows: 


m = pV? AlC x (13.11) 


The moment coefficient Cu is again a function of the four variables a, y, 
M, and Re, or 
Cu = Cu(a,y,M,Re) (13.12) 


If I is the instantaneous lateral moment of inertia of the vehicle about 
the center of gravity of the vehicle, then the magnitude of N in Eq. 
(13.6) is 


m 
N = T (13.13) 


With the notations defined above, the system of equations of motion 
is as follows: 
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dr _ 
= 
db ve 
d r 
dp 
7 = 
Wr zing (toara tra) ~o( ) 
di f roo (13.14) 
= F 
oe Ecos 8 — vA — os + wa — 20,(% 4 a) + | 


Va 


a i x cos Ê — v, - (+ wa = 20, (2 3 a)! +N 
= H 


This system of equations is a set of first-order equations for the six 
unknowns r, 6, 8, ve, vs, and 8. To solve it, the six initial values at the 
start, when ¢ = 0, for the unknowns must be specified. In addition, the 
thrust S, the weight W, and the moment of inertia 7 must be given for 
every time instant. To determine the aerodynamic forces the elevator 
angle y must be specified as a function of time. The properties of the 
atmosphere must be known; i.e., the wind velocity w, air density p, air 
viscosity u, and sound velocity a must be given as functions of the alti- 
tude r. The angle of attack «œ of the vehicle cannot be specified; it is 
a quantity to be computed from the angle 8 and the relative air velocity 
vector V. 

Let the properties of the atmosphere be standardized, and the average 
characteristics of the vehicle and its power plant be taken to be repre- 
sentative. Then if the elevator angle y is specified as a function of time, 
the flight path of the vehicle is determined and can be calculated by 
integrating the system of Eq. (13.14). The actual execution of this 
computation will probably be done on an electromechanical computer. 
This flight path of a standardized vehicle in standard atmosphere can be 
called the normal flight path. 

The dominating feature of the normal flight path is its range. This 
range is the distance between the take-off point and the landing point. 
The problem of navigation is then to calculate the proper time for cutoff 
of the rocket motor and the proper variation of the elevator angle during 
flight so that the resultant range is that desired. This problem of navi- 
gation for the standardized vehicle in the standard atmosphere can be 
solved mathematically before the actual take-off of the vehicle, since all 
information for the normal flight path is known or specified beforehand. 
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13.2 Perturbation Equations. Natural atmospheric characteristics 
do not necessarily coincide with those specified for the standard atmos- 
phere. The wind velocity at each altitude changes according to the 
weather conditions; the temperature T is also a varying quantity. 
Therefore one should expect variations from the normal flight path due 
to changes in atmospheric conditions. The actual vehicle also may be 
somewhat different from the standardized vehicle in weight, in rocket 
motor performance, etc. Therefore the actual flight path will be differ- 
ent from the normal flight path if the elevator angle program of the 
normal path is used. The problem of navigation of an actual vehicle 
is that of correcting the elevator angle program so that the range of the 
actual flight will be the same as the normal flight path and the destination 
is reached without error. Because of the rapidity of flight, this navi- 
gational problem cannot be solved by the conventional method, which 
neglects completely the dynamic effects and is based upon only kine- 
matical considerations. But instead, the problem should be solved by 
an automatic computing system, which responds to every deviation from 
the normal conditions with a speed approaching instant action. The 
problem is thus more appropriately called the guidance problem, and the 
control system, the guidance system. 

The general problem of guidance is very difficult indeed. However, 
the deviations from the normal conditions are expected to be small, since 
the normal flight path is, after all, a good representation of the average 
situation. This fact immediately suggests that only first-order quantities 
in deviations need be considered. This “linearization ” is the basis of the 
ballistic perturbation theory. The resultant system will have coefficients 
that are evaluated on the normal flight path and are generally functions 
of time. Therefore the fundamental equations in the ballistic perturba- 
tion theory are linear equations with time-varying coefficients. Our 
discussion of the guidance problem of a long-range rocket vehicle is 
thus an example of control design for such systems. The particular 
design specification here is the vanishing of range error. The controlled 
“input” here is the elevator angle corrections. We shall develop these 
concepts in the following discussion. 

Let the quantities of the normal flight path be denoted by a bar over 
them, and deviations, by the ô sign. Thus for the actual flight path, 


r= Ft or 6=6+ 56 B =B + ô 
r= Ü, + bv, v9 = Üe + ive =B + 46 
The deviations of the actual atmosphere from the standard atmosphere 


are expressed as the deviation of density ôp, the deviation of temperature 
ôT, and the deviation of wind velocity ôw; thus 


p=pto T=T+5T w=ŭġ+ bw (13.16) 


(13.15) 
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If we assume no change in the composition of the atmosphere at any 
altitude from that of the standard atmosphere, the knowledge of 5p and 
ôT is sufficient to calculate the deviation of pressure, if necessary. The 
deviation of the actual vehicle from the normal vehicle is assumed to be 
limited only to the deviation of weight ôW and the deviation of moment 
of inertia 62. That is, 


W=W+sw J=I4+8I (13.17) 


The thrust S is assumed to be fixed at the standard value. Thewing 
area A and the aerodynamic characteristics of the vehicle, as expressed 
by Eqs. (13.10) and (13.12), are also assumed to be invariant. 

Substituting Eqs. (13.15) to (18.17) into Eq. (13.14) and then sub- 
tracting the corresponding equation for the normal flight path, we have 
the following equations, according to our linearizing principle: 


d or 
q 7 
dôd Ü I 
a = on + 5 Ove (13.18) 
dsp: 
H ® 
d bv, 
7 = d1 ôr + aa 68 + as bv, + da va + as ôy + ds dp 
+ a7 ôT + as bw + dy ôW 
dè 
La = by r + be 5B + ba vs + ba Sup + bs by + bo dp (13.19) 
| + by ôT + bs bw + by ôW 
LIE L o, r + cz ÒB + Ca by + Ca Bva + 05 By + Cs bp 


+ cr ôT + cs ôw + ca ôW + Cio ôl 


The coefficient a’s, b’s, and c’s are partial derivatives of the F, G, and H 
defined by Eq. (18.14), evaluated on the normal flight path. For 
example, 


oF _ 78 _ Tar 
3 as = 3) as = (Fe) (13.20) 
i \ar Vw oo | OW 


The details of this calculation of coefficients are given in the appendix to 
this chapter. 
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Equations (13.18) and (13.19) are linear equations with variable coeffi- 
cients for the six deviation quantities. If the deviations of the atmos- 
pheric properties dp, ôT, and ôw are known, and if dy, ôW, and ôI are 
specified, then this system of differential equations determines ôr, 66, 68, 
$v, 6ve, and 66. The problem of guidance is, however, different from this. 
What is required is the function ôy, or the correction to the elevator angle, 
such that the range erroris zero. Assuggested by Drenick, this guidance 
problem can best be solved by the method of adjoint functions of Bliss.} 

13.3 Adjoint Functions. The principle of the method of adjoint func- 
tions is as follows. Let y,(¢), where è = 1, ... , n, be determined by a 
system of n linear equations 


n 


dy: — X at = Y,(t) for 2 = 1, s.. 8 n (13.21) 
j=l 


where a,; are given coefficients which may be functions of the time t. 
The Y;(t) are the “forcing” functions or inputs. Now introduce a new 
set of functions \,(#), wherez = 1,... , n, called the adjoint functions 
to y,(t), which satisfy the following system of homogeneous equations 


Bs Y aids = 0 t=1,..., A (13.22) 


By multiplying Eq. (13.21) by A; and Eq. (13.22) by y; and summing the 
equations over 7, we have 


di D Ayi — y V aa (Gipsy; — Aayi) = D Y; 


i=] J=] = 


The two parts in the double sum evidently cancel each other, and we have 


d 
t=] t=1 


This equation can be integrated from the instant £ = ¢, to the instant 


t = te, and : : 
z NY a + | . © dui) dt (13.24) 
i=l + d=] 


Bliss calls this equation the fundamental formula. 


1G, A. Bliss, “Mathematics for Exterior Ballistics,” John Wiley & Sons, Inc., New 
York, 1944. 


n 
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For the problem of long-range rocket, the y; are the perturbation 
quantities, that is, n = 6 and 


ger Yee. Age ne (13.25) 
Yy = 60, y5 = Ov¢ Yi = 68 


Then, according to Eq. (18.19), the adjoint functions \,(¢) satisfy the 
following system of equations 


_ M sis mle + Qa + bids + Ce 
dt F 
Gre 
2E i 0 
a Pi = aAa + beds F Code 
a (18.26) 
j T = M + aada + Dsd5 + Cade 
2 = ty, + dads + dads + Care 
t f 
De _ 
— = i 
The inputs Y; are 
Y, = Yo = Y3 = 0 (13.27) 
Y, = ds ôy + as 6p + a7 ôT + ag ôw + as ôW 
Y; = bs oy + be 6p + by ôT -+ bs ôW + by ôW (13.28) 


Ya = cs ôy + Cs dp + cr ôT + cg ôw + es ôW + Cy OL 


13.4 Range Correction. Equation (13.26) does not determine the i 
functions completely. To do that, a set of values for A must be specified 
at a certain instant. What values to pick for à at what instant depends 
upon the specific purpose of the control design. For our guidance 
problem, we require zero range error. Therefore the quantity of interest 
is ôf at the instant of landing, or 66. if the subscript » denotes quantities at 
the instant of landing. This is sufficient to determine the \’s completely. 
We shall see this presently. 

If ż is the time instant of landing of the actual vehicle and t the time 
instant of landing of the normal flight path, then 


bo = l2 -+ Obe (13.29) 
Similarly 
T2 = f2 + Ore 
9a = Be + 06s (13.80) 
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ôr = (Dp) tate lg + (r)i, 


1 13.31 
ôb = F (De) tnd, Slo + (60)12, l ) 


However, ôr is by definition zero, because landing means contact with 
the surface of earth, or re = fa = rẹ By eliminating ôt from Eq. 


(13.31), 
565 = |- + (®) or + J (13.32) 
r Vy i=tg 
Therefore, if the magnitudes of \,; at the landing instant t = ft, are 
specified as 
ee (p) m (13.33) 
As = M = Ap = Ap = 


then the error in range is given by 


n 
605 = > Niyi 


i=l 


i 
| 


t=fe 


= [hr Or + Ae 60 + 3 88 + Ag Od, + As va + As SB]Wz, (13-34) 


When the normal flight path is determined, the coefficients in Eq. 
(13.27) are specified as functions of time. These equations together 


Actual Path 
(80 rot, 
A (U) 27, 
Normal Flight NX -5 
Path | (Ur deet, 
a (87), 27, 
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Fie. 13.2 


with the end conditions of Eq. (13.33) then determine the adjoint func- 
tions \;. The integration has to be performed “backwards” for t < k, 
perhaps by an electromechanical computer. With the adjoint functions 
so determined, one can use the fundamental formula of Eq. (13.25) to 
modify the equation for the range error given by Eq. (13.34): Let & 
denote the time instant for the power cutoff for the normal flight path. 
Then the condition for the error in range 662 to be zero can be expressed as 
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802 = O = [Aa ôr + do 00 + As 68 H As dv, + As B09 + Ao BB) 
+ i. [MF a + As Ys + AcYe) dt (13.35) 


This is the basic equation for guidance. It will be exploited in the 
following sections. 

13.5 Cutoff Condition. The condition of Eq. (13.35) for arbitrary 
disturbances can be broken down into two parts; the sum and the integral 
are set equal to zero separately. Therefore the condition to be satisfied 
at the normal cutoff instant į is 


Ai ôr + Aa 60 + Az ôB + Ag bu, + As v9 + Ae ôb] = 0 (18.86) 


Since the normal cutoff instant fı is a standard time instant, but not 
necessarily the actual cutoff instant ti, t.e., 


= h + ôt (13.37) 


Eq. (13.36) should be converted into a more useful form involving the 
quantities at the actual cutoff instant. This is easily done, because up 
to the first-order quantities, according to Eq. (13.35), 


(Ir) eat, = (Pia mc) ba et 


(Brent, = (in — Pints — Grint Dha 


or 


Similarly, 
(80) = (Pray — (Peni -(3 is) _ by 
test 


where F, Ĝ, and Ë are the values of these quantities, given by Eq. (13.14), 
evaluated on the normal flight path. In fact they should be evaluated 
at an instant just before the normal cutoff time į so that the accelerating 
force of the rocket is included and the rates of change of velocities are 
those of a powered flight. Now define J and J as follows: 


= [Nir + AZO + AB + Ako, + Név + BB], (18.38) 
and 
J = [nie + ARG +E + NSD, A N + Be, 
where Af are the values of ); evaluated at the normal cutoff time &,. 
Then the condition to be satisfied at the actual cutoff instant t is 


CONTROL DESIGN BY PERTURBATION THEORY 189 


J=ĴJ+ Lat, + Wo + r¥6 + ATF + FG + val _ (44 —t) (18.39) 
This is the equation for determining the proper instant of power cutoff. 

When the normal flight path is known, J and the quantity within 
the bracket to the right in Eq. (13.39) are fixed. Then the whole right- 
hand side of Eq. (13.39) can be considered as a linearly increasing func- 
tion of time t if ¢ is substituted for ¢,. Simultaneously J can be com- 
puted at every instant before cutoff by using the predetermined \* and 
the values of position and velocity of the actual vehicle obtained by 
tracking stations. The magnitudes of the quantities on the two sides of 
Eq. (18.39) can then be continuously compared. When they are equal 
to each other, Eq. (13.39) is satisfied. Then the power cutoff signal is 
given, and the rocket power is shut off. 

13.6 Guidance Condition. When the rocket power is shut off earlier 
or later than the normal cutoff instant t, the propellant left in the tank, 
if not dumped, will alter the weight W and the moment of inertia J 
of the vehicle. It is also possible that the pay-load of the vehicle is 
not that specified for the standard vehicle. Then, after power cutoff, 
there is a fixed ôW and ôI, fixed in the sense that they do not change 
with time and are known once the power cutoff is effected. Of a different 
character are the deviations 6p, ôT, and ôw of the actual atmosphere 
from the standard atmosphere. These are not known unless they are 
measured. In the following, it is proposed to use the vehicle itself as a 
measuring instrument, and we proceed as follows. 

After the cutoff condition is satisfied, the condition for zero range 
error is that the integral in Eq. (13.35) should vanish. Now since the 
Y; in that integrand involve arbitrary disturbances 6p, ôT, and ôw not 
known beforehand, this condition can be satisfied only if the integrand 
itself vanishes. That is, according to Eqs. (18.28), 


(Msas + Asbs + recs) ôy + uas + Abe + Asco) Sp 
+ (Mar + sbr + eC) ôT + (Asas + sbs + A6Cs) dw 
-F (Aado + A509 + eC) OW + A6C10 0l = 0 


or, with the following notation 


ds = gs + Asds + Vets 

de = Mas + Asde + Asls 

dy = Asar + Azby + AgC7 (18.40) 
ds Asas + Asbg + A6Cs 

D — (Mlo + hsb + eC) OW = N6Ci0 ôl 


this condition can be written as 
ds oy + de dp + dy ôT ae ds bw = D (13.41) 
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Equation (13.19) can be rewritten as 


as ôy + as 6p + ar ôT + as dw = A 
bs ôy + be ôo + by ôT + bs ôw = B (13.42) 
Cs by + cs p + cy T + cs ôw = C 


where 
= AO L ay êr — an 88 — a5 60, — 04 B09 — as SW | 
= om — bi ôr — ba 68 — b; dv, — ba bug — by BW (13.48) 
= 18 L o ar — or 58 — os Bur — ca 500 — cs BW — ew ôT | 


If the tracking stations for the vehicle will measure the quantities A, 
B, and C, then the atmospheric disturbances 6p, ôT, and ôw can be 
determined by solving for these variations using Eq. (13.42). This is 
essentially using the vehicle itself as a measuring instrument for dp, ôT, 
and ôw. When 6p, ôT, and ôw are known, Eq. (13.41) gives the proper 
elevator angle correction ôy. 

A mathematically equivalent way to calculate ôy would be to solve 
directly for ôy using the system of Eqs. (18.41) and (13.42). Thus 


Qs Ge ar as A dg Gr Gs 
bs be by bg = B be by bs 
Cs Ca Cy Cg S C Ce Cr Cs (13.44) 
ds de dr ds D ds dr dg 


This equation specifies the necessary change in the elevator angle at 
every instant, to be calculated from a’s, b’s, c’s, and A, B, C, D at the 
same instant. These quantities consist partly of predetermined infor- 
mation from the normal flight path, and partly of measured information 
on the position and the velocities of vehicle obtained by tracking the 
vehicle. At high altitudes where the air density is very small, the aero- 
dynamic forces will be almost negligible in comparison with the gravita- 
tional and inertial forces. Then the quantities A, B, and C of Eq. (13.48) 
will be the small difference of large magnitudes. : These are then the 
quantities most difficult to determine accurately. If the actual ele- 
vator angle is made to conform with the one calculated by Eq. (13.44), 
then in conjunction with the proper power cutoff, as specified in the last 
section, the vehicle will be navigated to the chosen landing point in spite 
of the atmospheric disturbances. 

13.7 Guidance System. When the general character of the flight 
path has been chosen from over-all engineering considerations, the first 
step is the calculation of the normal flight path using the properties of 
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the standard atmosphere and the expected performance of the vehicle 
with normal weight. The knowledge of the normal flight path then 
determines the a’s, b’s, and c’s. Equation (13.27) together with the end 
conditions of Eq. (13.33) allows the calculation of the adjoint functions 
\;. All this information should be on hand before the actual flight of the 
vehicle and may be called the “stored data.” 

Before the power cutoff, the elevator angle may be programed accord- 
ing to that for the normal flight path, and the stability of the vehicle 
is supplied by the jet vanes or by the auxiliary rockets. The tracking 
stations, however, go immediately into action and supply the vehicle with 
information on its positions and velocities. This information goes first 
into the cutoff computer which, using the stored information, continu- 
ously compares the magnitudes of quantities on the two sides of Eq. 
(13.39), the cutoff condition. When that condition is satisfied, the power 
cutoff is effected. 

At the instant of power cutoff, the tracking information is switched 
to the computer for the guidance system. The instant of power cutoff 
also fixes the amount of propellant in the tank and thus determines the 
variations of the weight W and the moment of inertia J from the normal. 
This information together with the stored data on the normal flight path 
then allows the computer to generate the elevator correction angle dy 
according to Eqs. (13.40), (18.48), and (13.44). Theoretically the value 
of ôy must be obtained without time delay from the instant when the 
information is received, because Eq. (13.44) 1s a condition of equality 
of two quantities evaluated at identical time instants. The computed 
correction ôy combined with the elevator angle 7, determined for the 
normal flight path, then gives the actual elevator angle setting y. The 
design of the control mechanism for the elevator from here on can follow 
the practice of the conventional feedback servomechanism, with the 
usual criteria of quick action, stability, and accuracy. The general 
scheme of the guidance system can then be represented by Fig. 13.8. 

The computers envisaged here are carried in the vehicle and receive 
the information on positions and velocities of the vehicle from the fixed 
ground tracking stations along the flight path. Then, as indicated in 
Fig. 13.3, this is the feedback link. Properly designed computers here 
assure us of the specified performance of the system and thus function 
as the compensating circuit or the amplifier of the conventional servo- 
mechanism. In over-all conception then, our guidance system is very 
similar to the simple servomechanism studied in the preceding chapters. 
However, the guidance system is a system of great complexity. Its 
design requires the theory of perturbation together with the concept of 
adjoint functions. Our example of the guidance of a long-range rocket, 
although somewhat oversimplified, serves the purpose of showing how 
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the theory can be used to design the control system. In the example, 
there is only one design criterion, i.e., the vanishing of range error. In 
more complicated systems, more than one design criterion can be speci- 
fied, and more than one set of adjoint functions will be required. Never- 
theless, the design principle will still be the same as that shown in the 
simple example. 


Tracking Information 


Cut - off Cut-off Signal 
Computer cas 


. 


Stored Data 


Elevator Servomechanism 


Tracking 


Guidance 
Computer 


Atmospheric 
Disturbances 


instrument 


AE ED RS m RE ND Sei ae AOD GRR e a aeey OUI O SD ewe a que ARIE 


Tracking Information 
Fic. 13.3 


13.8 Control Computers. Although it is not the purpose here to dis- 
cuss the detailed construction and the engineering of any components 
of the systems considered, the role of the computer, first introduced in 
connection with optimum switching in Chap. 10, is so important in the 
more advanced control systems that a general discussion of its character- 
istics and requirements would be appropriate. For details, the reader 
should refer to books on this special subject.! 

There are in current use two different kinds of computers: the analog 
computer and the digital computer. The analog computer is just what 
its name implies: a physical analogy to the type of problem its designer 
wishes to solve. It is then a system which is described by the same 
mathematical formalism as the computation to be carried out. The 


1 Engineering Research Associates, “High-speed Computing Devices,” McGraw- 
Hill Book Company, Inc., 1950. For d-c analog computers, see G. A. Korn and T. M. 
Korn, “Electronic Analog Computers,” McGraw-Hill Book Company, Inc., New 
York, 1952. 
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inputs to the machine are in terms of the value of some physical quantity 
—an electric voltage or current, the degree of angular rotation of a 
shaft, or the amount of compression of a spring. The machine trans- 
forms these inputs into other physical quantities, the outputs, according 
to the rules of its construction, chosen by the designer to represent the 
specified mathematical procedure. The inputs to an analog computer 
are then the instrument readings of the several physical quantities of 
the system to be controlled, and the outputs of the computer are com- 
mand signals fed directly to the individual servomechanisms of the 
controlled quantities. 

In contrast to the analog computer, a digital computer works by 
counting. Data on the problem must be supplied in the form of num- 
bers; the machine processes this information according to the rules of 
arithmetic or other formal logic demanded by the computing problem, 
and expresses the final result in numerical form. There are two very 
important consequences of this manner of operation. First, input and 
output equipment, or the “transducers,” must be designed to make an 
appropriate translation between the logical world of the digital machine 
and the physical world of the controlled system. Second, the problem 
to be solved must be formulated explicitly for the digital computer. In 
the case of the analog computer, the problem is implicit in the construc- 
tion of the machine itself; construction of a digital computer is deter- 
mined not by any particular problem or class of problems but by the 
logical rules which the machine must follow in the solution of the particu- 
lar computing problem. 

In comparing digital and analog computers as components in a control 
system, we observe, first, that for simple control applications the analog 
machine is almost always less elaborate than a digital machine would be. 
Even the most elementary digital computer requires an arithmetical 
unit, a storage unit, a control unit, and input and output transducers. 
For simple problems, this array of equipment is wastefully elaborate. In 
contrast, an analog computer need be no more complicated than the 
problem demands. In fact, as mentioned before, the compensating 
circuit in a simple servomechanism is such an analog computer. 

As the computing task becomes more complex, as for the guidance 
problem discussed in this chapter, the analog machine loses its advantage, 
and we see a second fundamental difference between the two types of 
machine. The analog computer, being a physical analogy to the problem, 
must be more complicated for more complicated computing problems. 
If it is mechanical, longer and ever longer trains of gears, ball-and-disk 
integrators, and other devices must be connected together; if it is electric, 
more and more amplifiers must be cascaded. In the mechanical case, 
the inevitable looseness in the gears and linkages, though tolerable in 
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simple setups, will eventually add up to the point where the total back- 
lash or “play” in the machine is bigger than the significant output 
quantities, and the device becomes useless. In the electric case, the 
random electric disturbances, the noise, which always occur in electric 
circuits, will similarly build up until they overwhelm the desired signals. 
Since noise is far less obtrusive than backlash, electric analog computers 
can be more complicated than their mechanical counterparts, but still 
there isa limit. The digital computer, on the other hand, is entirely free 
of the hazards of backlash and noise. There is no intrinsic limit to the 
complexity of the problem that a digital machine can handle. 

The third important difference between analog and digital computers 
is in their potential accuracy. The precision of the analog computer 
is restricted by the accuracy with which physical quantities can be 
handled and measured and by the accuracy of representing a physical 
system by idealized laws. In practice, the best such a machine can 
achieve is an accuracy of about 1 part in 10,000; many give results 
accurate to only 1 or 2 parts in 100. For some applications this range 
of precision is adequate; for others it is not. On the other hand, a 
digital computer, which deals only with numbers, can be as precise 
as we wish to make it. To increase accuracy we need only increase the 
number of significant figures carried by the machine to represent each 
quantity being handled. Of course, the over-all accuracy is still limited 
by the accuracy of the input and output transducers; but this does not 
alter the fact that where high precision is required, a digital computer is 
preferable to the analog computer. 

There is a fourth aspect in which the two types of computers differ. 
An analog machine works in what is called real time. That is, it continu- 
ously offers a solution of the problem it is solving, and this solution is 
appropriate at every instant to all the inputs which have so far entered 
the machine. On the other hand, a digital machine works by formulat- 
ing and solving an explicit logical model of the computing problem. 
Therefore a digital machine gives only a sequence of values of the output 
at discrete time instants, and, furthermore, because of the finite, although 
short, computing time, the output lags behind the input. Two problems 
then arise: the problem of interpolation of the output between the dis- 
crete time instants, and the problem of prediction of the output from 
fast values to remove the time lag of the output. Obviously, if the 
computing time is very much shorter than the characteristic time of the 
controlled system, the prediction question can be ignored, and the com- 
puter considered to be working in real time. The present-day electronic 
digital computer seems to be fast enough in this respect for the guidance 
problems of the long-range rocket discussed previously; but for a high- 
speed guided missile, this time-lag effect must be properly included in 
the design of the control system. 
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Appendix to Chapter 13 


CALCULATION OF PERTURBATION COEFFICIENTS 
The quantities F, G, and H are defined by Eq. (13.14). They contain the param- 
eters =, A, A, and N. According to their definition, as given by Eqs. (18.2) and 
(13.13), they can be written as follows: 


s= 
E wW 
g 1l ee 
A = 55 pPACL v + (ug + w)? 
i (18.45) 
A= Wy 9 PACD V ve + (vg + w)? 
N = 75 pACul0t + (g + w) 


where the aerodynamic coefficients Cz, Cp, and Cy are functions of the angle of attack 
a, the elevator angle y, the Mach number M, and the Reynolds number Re. These 
aerodynamic parameters are related to quantities immediately connected with the 
flight path as follows: 
= 8 — =i ut = Vv = pyt 

a = 6 — tan (~ a -) M = i Re = i (13.46) 
where a(r) is the sound velocity in the atmosphere, and u(r) is the coefficient of viscos- 
ity of air, both functions of altitude r. In the following calculation, the thrust S will 
be considered to be a function of altitude only. It is also assumed that the composi- 
tion of atmosphere at different altitudes remains the same as that of the standard 
atmosphere; only the density p and the temperature T changes. Thus the variations 
of a and u at any altitude are variations due to temperature T. 


For 2: 
2 gös 92 __ = 
or Wor ow W rel 
All other partial derivatives are zero. 
Paes 
=a fe? (1+ 3252) + tar leo ao ot + 1) 
p ar Cz dRe V2 dr Cr âM Cr ôRe 
L 80a, | — M oz Lda _ Be aC 1da) 
REO E oe » | Mar Crone par 

OA ri (Geom ðC 4. Re Cy 4+ 1 pest) 
Ov; V2 \CraM ` CraRe Cr da v 
ðA _ us Hw (aCe, Redes y -lC Vy ) 
ave y? C ðM Cr dRe Cr da vetw 
aA a 1 aC | 
op Cr da 
ah, 1 Cy (13.48) 
ov ðY 
oA Re dC rz 
ap a= (4 to Cz T) 
h a a (MiGs 1 4 ReaGr tan) 
ôT CraM 2T ' CzrôReuôT 
ðA ve t+w(/MaCrt _ RedCy l aC, ùr _ oA 
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196 ENGINEERING CYBERNETICS 


For A, the partial derivatives are obtained from the above equations by substituting 
A for A, and Cp for Cz. 
For N: 


aN _ ldp Re dC xy 1 dwf / M Cu , Re Cu 
oo ea ( m tae) t Vide (Ge, aM | Cy Re 2) 
1 dC |- M dCylda Re ig) 


toa | CaM adr Rent 


aN _ AG ICar | Re aC 2 7 ue) 
yi Cu 6M Cu dRe Cu da Vy 
ƏN ,,vetw/M Cy , Re dCu 1 Cm v 
ðv y? G aM t GyaRe T2 T Gy Oa da ——) 
aN 1 Cx 
ôb Cu ða 
aN ‘1 aCy (13.49) 
ðy Cu dy 
aN _ Re Cu 
ap u (1 T Oy Cu ES) 
aN ey |; (=F ôu 1 Re Cy 1 ĉa) 
oT Cu OM 2T Cis aRe par 
aN | aN 
dw due 
an _N 
al I 


With these partial derivatives, the coefficient a’s, b’s, and c’s can be easily calculated: 


_OF a2. dw dA ðA , [ve : 
a =F = Gsm Bt EA + Oo tu) St noe + (2 4 a) 
va (ve g {ro 2 
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ee ahr) 
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ôG az ðA dw ðA — vwo 
ep a a ee 
aG ; OA OA 
bg = gg = Zain 8 — v gg T ve tw) ae 
aG ðA A 1 ve 
bs ge ee Jay, 7? TET) 
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After the power cutoff, the thrust S vanishes. 
tives are zero. 


Thus for t > f, Z and its deriva- 


CHAPTER 14 


CONTROL DESIGN WITH SPECIFIED CRITERIA 


In the earlier chapters, we discussed the design problem of control 
systems mainly from the point of view of analysis. That is, assume 
the construction of the system, and then find out whether the performance 
of the system is satisfactory. In the last chapter, we introduced for the 
first time a different and more direct point of view: we specify the per- 
formance first, and then find out the necessary control system which 
will give the desired performance. In this chapter, we shall extend this 
principle to arbitrarily controlled systems, for which the performance 
criteria are expressed in terms of integrals of the controlled variables. 
The resultant system behavior is represented by a very general equation, 
which is usually nonlinear. Thus the control system so designed is 
generally a nonlinear system. The nonlinearity here, however, is pur- 
posefully chosen to give the optimum performance of the over-all system. 

Mathematically, we can describe the principle of control design with 
specified criteria as follows. In the system to be controlled, we intro- 
duce one or more extra variables. These extra variables, being arti- 
ficially created, are not determined by the intrinsic physical laws of 
the controlled system. We obtain the conditions for determining the 
extra variables by satisfying the specified criteria of the over-all per- 
formance. These conditions are then enforced through the computer 
built into the system. ‘This principle of control design was first suggested 
by Boksenbom and Hood.! The following discussion follows partly the 
cited work of these authors. 

14.1 Control Criteria. If y is the output of the controlled system, 
then it is reasonable to expect that the measure of over-all performance 
of the system is expressed as the time integral of some function f of y. 
Then the criterion of the performance is that this integral is to be mini- 
mum or a constant; that is, 


is f(y) dt = const. or min. (14.1) 
or, specifically, 
k (y — ys)? dt = const. or min. (14.2) 


1A. S. Boksenbom and R. Hood, NACA TR 1068 (1952). 
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where ¢; is the time at the end of transient and y, is the setting or the 
desired value of y. Equation (14.2) weights the error in y as the square 
and according to the time duration of that error, and is thus a measure 
of the mean-square error from the setting. Another type of criterion 
may be that which requires a criterion time duration to be a minimum 
or a constant; that is, 


Í “ dt = const. or min. (14.3) 


The use of a single criterion, such as Eq. (14.1), will yield f(y) = con- 
stant. This result is reasonable because f(y) can usually be made to be a 
constant if no additional criteria are imposed on other variables in the 
system. Usually, however, certain limiting conditions exist on other 
variables in the system, and these conditions must be included in the 
original criterion. Thus, for example, a possible criterion could be writ- 
ten as follows: 


ti A ; ) 
i (y — y)? dé = min. | 


j (14.4) 
for ; f(z) dt = const. 


If, for instance, y is the engine speed and z is the characteristic tempera- 
ture of a gas turbine engine, the criterion of (14.4) states that it is desir- 
able to design a control system such that, for a particular value_of a 
temperature integral, the integral of the speed-error squared is a mini- 
mum. This criterion may be used if, for instance, it is known that an 
overtemperature condition can be tolerated for a certain period of time 
and it is desired to keep the average speed error at a minimum during 
this transient. Then the integral of z represents the total heat input to 
the turbine blades. 

The general theory will show that as many criteria as desired of the 
type shown in Eqs. (14.1) to (14.4) can be included together, and a con- 
trol system can be derived that automatically satisfies all these criteria 
simultaneously. 

Another aspect of the control criteria is the end conditions of the 
integrals of Eqs. (14.1) to (14.4). The time interval for which these 
integrals are to be a minimum or a constant must be chosen. A reason- 
able time interval is any duration during which essential external dis- 
turbances are constant and during which the system to be controlled 
moves from one essential level of operation to another. The essential 
external disturbances are those that cannot be immediately corrected 
by the control system. If an essential external disturbance occurs in the 
time interval of the criteria, no physically realizable system could be 
designed to anticipate this disturbance so as to behave properly before 
the disturbance takes place. An essential level of operation is any 
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specific condition of only those variables that must be continuous. 
In the case of a turbojet engine, the transient behavior of which can be 
described by a first-order differential equation, the engine speed deter- 
mines the level of operation. If the fuel system must be considered 
or if the temperature does not respond to speed immediately, then both 
the engine and the acceleration are required to describe the essential 
level of the engine. We shall see this presently. 

The control system resulting from any design method must be phys- 
ically realizable. There are two aspects to this problem. First, it is 
possible to set down criteria that are not realizable with any system 
or are incompatible with each other. If such criteria are used, the 
unrealizability will appear either as a requirement on the control to 
look ahead into the future or as an inability to satisfy the boundary 
conditions of some differential equation. In most cases, a clear under- 
standing of the criteria used and of the system to be controlled will 
preclude incompatibilities of this sort. 

The second aspect of physical realizability is purely mathematical. 
It is desired to derive a description (a differential equation) of the control 
or the controlled system that satisfies the criteria of control and all the 
necessary boundary conditions that arise in the derivation of this equa- 
tion. Although the mathematical solution of the problem may be any 
derivative or integral of this differential equation, the physical solution 
of the problem requires the differential equation that itself satisfies the 
boundary conditions and for which no undetermined constants of integra- 
tion exist. Thus, such forms as 

Y= cz 
and : 
y = Cx 
are not necessarily interchangeable as descriptions of some part of a 
controlled system, because the forms differ by an undetermined con- 
stant of integration. For stable linear systems, the effect of this constant 
becomes vanishingly small; for the general nonlinear systems presented 
here, however, this constant must be considered. 

14.2 Stability Problem. The requirement of stability is a special 
criterion that does not enter into the design of the main control system 
during the transient. This situation is the same as that of the last 
chapter where the stability criterion is also suppressed, because the 
satisfactory performance of the over-all system is already made certain 
by the imposed performance specifications. However, it is usually 
necessary to add to the controlled system a stability device that does 
not go into action until the final instant of the transient interval. There- 
fore, this stability device will not affect the behavior of the system as 
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far as satisfying the other criteria is concerned. This device can be 
described as follows: For a first-order system, when 


y = Ys 
then j=0 | (14.5) 


For second-order systems, when 


Y= Ys 


then y=0 and ğ = i (14.6) 


When such a stability device is added to the control system, the con- 
trol system has two modes of operation and is thus a multiple-mode 
system (cf. Sec. 10.9). During the transient, the main control system 
is in operation to produce the specified performance. At the end of 
transient, the control is switched to a second system, represented by 
Eq. (14.5) or Eq. (14.6), to ensure stability of the system at the end 
state and thus to avoid running away from the desired operating point of 
the system. 

14.3 General Theory for First-order Systems. With the type of 
control criteria given previously by Eq. (14.1) to (14.4), we can formu- 
late the control equation in the following manner: if, for such a list of 
criteria, one of the integrals is to be a minimum under the condition that 
the other integrals are to be constant, it is sufficient, according to varia- 
tional calculus, to make 


EFU) at +s f(y — yot dt + da f fale) dt + rs [dt = min. 


or 


[EUO + ray — 9) + dafole) + Aad dt = min, (14.7) 


The X's are arbitrary constants that enter into the control system as the 
adjustable parameters and are precisely determined by the choice of 
values that the constant integrals are to have. The technique of the à 
multipliers is widely used for problems of this type, where one condition 
is to be a minimum under other restrictive conditions. Indeed, the con- 
ditions need not be in integral form, and any functional or differential 
relation among variables can be handled in a similar manner.! Equa- 
tion (14.7) can be made very general when all possible restrictive condi- 
tions are included. In the final equations, if any one criterion is not to 
be used, then the corresponding à — 0. If any of the criteria are to be 
zero, then the corresponding \—> ©. 


1 See for instance C. Lanczos, ‘The Variational Principles of Mechanics,” Univer- 
sity of Toronto Press, Toronto, 1946. 
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If the system to be controlled is of jirst order with constant coefficients 
and with one essential output y, then the variables y and z must be 
related so that z = 2(y,y). Equation (14.7) can then be written, in 
general, as 


f * Ply ý) di = min. (14.8) 


where F is a continuous function of y and y, and y is a continuous func- 
tion of time t We note that F is not explicitly dependent on the time t. 

Let us consider y(t) to be a solution, that is, y(t) is the output among 
all admissible outputs which satisfies the condition of Eq. (14.8). To 


òy (t) 
ERES A 


Ceai 


y=f(t) 


y(t) êt 
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test this, we construct a neighboring solution as y(t) + e dy(t). y(t) is 
an arbitrary function, and e is a small parameter. If the condition of 
Eq. (14.8) is satisfied by y(t), then 


Ezz 


d tı-+e dt1 E 
¢ i Fly + e dy,y + e dy) | = 0) 
E€ jO 0 


ii ¿i 
o y o oY 

The variation ôi, occurs because of the fact that the end point of the 
integral of Eq..(14.8) is not fixed, but lies on the curve y = f(£), as shown 
in Fig. 14.1. This is to allow the proper boundary conditions of moving 
from one essential level of operation to another, as previously discussed. 
By partial integration, Eq. (14.9) becomes 


"| aF d {oF oF oF 
! E ~ a (=) | Ay Gear (Fr). bylt) — gy VO + Fl) bh = 0 


The relationship between ôy(t1) and ôt, can be easily computed from the 
end condition. That is, 


ý bt + byl) = F(t) bts 
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Then by eliminating ôy(tı), and since ôt is arbitrary, we have 


i Í ES £ @ )| sy di = 0 (14.10) 
bty [re F al [re — 9 a} - 2) ôy(0) = 0 (14.11) 


The time interval during which the criterion of Eq. (14.8) is to hold 
is considered as that during which the system moves from one essential 
operating level to another; in this case, from one definite value of y to 
another definite value of y. Thus the end curve y = f(d) must be a 
straight line with f(t) = constant. Hence 


and 


òy(0) = 0 | 
Thus Eqs. (14.10) and (14.11) become 
OF ad foF 
ay a (2) (14.13) 
and 
F(t,) = y(ts) (a if (2) is finite (14.14) 
Oy dy jo 


Equation (14.13) need not hold at ¢ = 0 because ôy(0) = 0. The only 
conditions that need hold at ¢ = 0 are that (3F/ðý)o is finite and y is 
continuous. At the start of a new transient, y, F, (@F/dy), and (oF /dy) 
may be discontinuous, whereas at other points (0 < ¢ < #,), dF /oy will 
be continuous because of Eq. (14.18). 

Equation (14.18) is the differential equation for the y(t) that satisfies 
the original criterion of Eq. (14.8). This equation is the so-called Euler- 
Lagrange differential equation of our variational problem. For the prob- 
lem considered here, F does not explicitly depend upon t. Then we can 
immediately obtain a first integral of this equation. The first integral 
of Eq. (14.13), which satisfies the boundary condition of Eq. (14.14), is 


oF 


Pyy) = y E7 (14.15) 
By differentiating this equation with respect to ¢, we have 
oF. , . oF d {oF 
ito 55 t 8a a) 


or whenever y, 0F'/dy, and so forth are continuous 


EE 
Yl ay diag) | ~ 
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Thus either y = 0 or Eq. (14.18) is satisfied. However, y does not 
generally vanish during the transient. Thus the two conditions on y(¢) 
as specified by Eqs. (14.13) and (14.14) are now replaced by the single 
equation (14.15). 

Thus Eq. (14.15) is the description of that physically realizable system 
the behavior of which will automatically and simultaneously satisfy those 
criteria included in the function F during that time interval for which 
the external disturbances are constant and during which the system 
goes from one operating level to any other operating level. At the end 
of the transient when the end level of y is reached, a stability device 
must be added to the system; the description of such an ideal device is 
that of Eq. (14.5). 

14.4 Application to Turbojet Controls. In the usual case of designing 
turbojet engine controls, the engine speed N that sets the essential oper- 
ating level of the engine is to be set or controlled. As the result, other 
pertinent characteristics, such as thrust, are also set. Limiting condi- 
tions of the engine are those of overspeed, overtemperature, compressor 
surge, and rich burner blowout. Let N, be the engine speed setting, T 
be the inlet temperature to the turbine, and P be the discharge pressure 
of the compressor; then the criteria on the behavior of this engine can be 
specified as the following integrals: 


% fi (N — N,) di for speed control 


en rere ten ee eel 


i Fo(N) dt for speed overshoot 
k f:(T) dt for temperature overshoot and 
undershoot (14.16) 


i falP — g(N)] dt for compressor surge 
. fsiP — h(N)] dt for blowout 
and i “di for rise time 


The nature of these functions is sketched in Fig. 14.2. The quantity 
P — g(N) is the amount by which the compressor discharge pressure 
exceeds the safe pressure for surge, and g(N) is the compressor discharge 
pressure for each engine speed at a safe value below surge. Rich burner 
blowout can be handled in a similar manner. The rise time is the total 
time for the system to move from one essential operating level to the other. 

Similarly to the treatment on turbojet behavior in Sec. 5.6, the linear- 
ized engine characteristics can be expressed as follows: 


T = aN tarN 


P =bN +c? (14.17) 
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where r is the engine time constant. By substituting these relations 
into the integrals of Eq. (14.16), we see that they all take the form 


I “AN N) dt 


where f is a continuous function of N and N, and N is a continuous 
function of t. 


f,(N-Ng) ha N) 


N-N; Nmax 


f(T) f,[P-g(N)] 
P—g(N 
fsLP—h(N)] a(N) > 
O P-h 


ae 14.2 


14.5 Speed Control with Temperature-limiting Criteria. If only the 
error in speed control is considered important, the criterion becomes 


IN — N.) di = 


Then the control condition of Eq. (14.15) simply gives 
AN-N) = 0 
Then because of the nature of the function fi, N = N.. This result 


means that, in the absence of other criteria on the engine behavior, 
this speed control should keep the speed error identically zero, which is 
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physically possible only in the sense of allowing infinite temperatures. 
This result, however, is inconsistent with the previous development of 
Eq. (14.15), in that N is not a discontinuous function of time. This 
instance is actually a trivial case of the general problem. But the result 
does indicate that a criterion like this must be accompanied by an 
additional criterion to give a physically sensible system. 

Now if the error in speed control is to be combined with the condition 
on the overshoot and undershoot of temperature, then 


JEAN — N) + ND] dt = min, Gere) 


Therefore F = f,(N — N) + Af2(T). By using Eq. (14.17), Eq. (14.15) 
becomes 

JN — N.) + Af(T) = dAarNf;(T) (14.19) 
This is the control equation during the transient. At the end of the 
transient, the ideal stability device is switched on, so that when 


N=N, 
then N=0 


Equations (14.19) and (14.20) describe the complete control system. 
Therefore we can visualize a computer so designed that it takes informa- 
tion from the measurements on N and T, the stored information on \, a, 
and 7, and the relation between the fuel rate and N and T, and gener- 
ates the signal for the proper fuel rate in accordance with Eq. (14.19). 
Then shortly before N reaches N., the stability device takes over, so 
that at the end of the transient, Eq. (14.20) is satisfied. In general, the 
control equation (14.19) is nonlinear, and the computer cannot be a 
linear device, such as a simple RC circuit. 

As an example it is convenient to let f3(f) = (T — Le)" for T > L; 
and f3(T) = (Lı — T) for T < Li. In general, the power n should be 
>1, because when n < 1, T may be infinite and of such nature as to make 
N discontinuous and physically unreal, even though the integral 


, f(T) di 


is finite. In the example of this discussion, let n = 2, and, furthermore, 
JN — N.) = (N — N.. Therefore we again take the mean-square 
deviation from the setting as a measure of the error. Then Eq. (14.19) 
becomes 


(14.20) 


= 2 ‘ 
= + (L — aN)? = ær N? (14.21) 


where, for acceleration, or when N < N,, 
N>O and L=L, (14.22) 
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For deceleration, or N > N,, 
N<0 and L= l (14.23) 


The block diagram for the control system is shown in Fig. 14.3. Ne 
is the actual engine speed. We assume the case of deceleration with 
N > N.. During the transient, Ne — N, is positive, the switch between 
the computer and the engine servomechanism is closed, and the signal 
from the computer is in action. The computer generates the signal 
arN according to the control Eq. (14.21). In Fig. 14.3, the signal is 
schematically indicated by a rectangular triangle. The engine servo- 
mechanism is so designed that the signal arN from the computer is 
actually obeyed closely by the engine. This is done by using a high- 
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gain circuit as indicated. When the speed error (NV. — N.) is reduced 
to a very small value, the computer signal is switched off. Then the 
stability of the engine servomechanism will guarantee the stability of 
the system at the engine setting N., and the system essentially satisfies the 
condition of Eq. (14.20). 

The control system has one adjustable parameter A. For any value 
of à, this system will, for the value of integral temperature overshoot 
obtained, give the minimum value of the integral speed error squared. 
The value of A determines the actual value of the integral temperature 
overshoot. Let us consider the special case where aN, = L; that is, 
acceleration or deceleration to the speed that corresponds to the limiting 
temperatures occurs, according to Eq. (14.17). It is interesting to note 
that for this special case, NV = 0 when N = N,, according to the control 
condition of Eq. (14.21). Therefore no separate stability device is neces- 
sary, and the switch in the control system as shown by Fig. 14.3 can be 
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eliminated: In this special case, Eq. (14.21) becomes linear and can be 
written as 

E(L — aN) = arN (14.24) 
where 


3 
ae (1 ra 5) (14.25) 


Now the integrals can be easily computed. For instance, the temperature 
integral is 


ti ty ; 
l (T —L) dt = I (aN — L + arN)? dt 
0 0 


ti 
(os » | (L — aN)? dt 
0 


a(t — yf (Nam nae 


aX(E — yf (N, — m 
No 
WN pN a 


a'r(E — 1)? 


No Ea(N, =F N) 
= oy EOS ww. — way? A - a? 
Thus 
1/" T-L} , (E-1)? 
where Na is the engine speed at the beginning of the transient. Similarly, 
the speed integral is 
N—N, 1 
z 0 Be: — aN 3) at = oR ween 
and if Tmax is the maximum temperature, then 
SEENA Eg L S 
oan E-1 (14.28) 
From Eq. (14.24), we have 
Ba(N, — N) = ar 


so that the characteristic time 7* for the controlled transient is 


ras (14.29) 


The left sides of these equations have been put in dimensionless form. 
The maximum temperature Tmax occurs at the beginning of the transient. 
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For E = 1 (A = œ), the temperature does not overshoot, in agree- 
ment with our previous statement that the constant integral has the 
value zero when \—> ©. ‘The'speed integral is 0.5, andr* =r. As E 
increases (or à decreases), the temperature integral and the maximum 
temperature increase, whereas the speed integral and the time constant 
decrease. A compromise value for E may be 1/2, or as = 1. Once 
the choice of E or d is settled, the specification for the control computer 
is fixed. The design can then proceed. 

For the general case of Eq. (14.21), the calculation of the values of the 
integrals is somewhat more cumbersome, but a similar procedure applies 
for the design of the control system. Boksenbom and Hood actually 
give solutions of Eq. (14.26), that is, N as a function of time ¢. But 
we should emphasize here that such explicit solutions are not necessary 
for the control design. The information for control design is fully set 
forth by Eq. (14.24) itself, because that equation tells how the control 
computer should be constructed. If the control computer is made 
according to that condition, then the desired performance is ensured. 
The actual variation of N with respect to time is thus of no importance. 
Hence our approach to the design problem is to “design” the nonlinear 
control equation itself rather than to design according to the solutions 
of an assumed equation. 

14.6 Second-order Systems with Two Degrees of Freedom. For the 
case of a second-order system with two degrees of freedom and with 
constant coefficients, Eq. (14.8) becomes 


f Pudi) dt = min. (14.30) 
where y and z are the outputs and are independent functions of time. 
The condition to satisfy Eq. (14.30) is 


d tite õi 


de Fly + e dy,y + eý, ý + € 69, 
E€ JO 


z + e 62,2 + e 62,2 + € 62) dt = 0 
ate = 0 


(14.31) 


The time interval of the integral of Eq. (14.30) begins at a definite time 
(t = 0) but does not end at any definite time, but rather along the curves 
y = fild), ý = fo(t), 2 = gilt), and 2 = g(t). The functions dy and ôz 
are arbitrary and, naturally, independent functions of time. 

Performing the operation indicated by Eq. (14.31) gives 


“| oF oF... OF... OF oF .. , oF | 
+ F(t) ôt = 0 
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After integration by parts, we have 
ala d Gi) d? ee 
h x-a ay) aeg] YA 
oF d ze 
ty 
u HI +5 46 z) u| + F(t) ôt 


d {oF B 
ae HA be + E og sea Nal = 0 (14.32) 


As before, the integrands of the integrals and the boundary-condition 
terms must vanish separately. From the specified end condition, we 
have 
òy (i) = [fi(6) — y)] ots 
by(ts) = [fo(t) — G) ot 
14.33 
selt) = [gi(ts) — a(h)] Of cee 
b2(h) = [go(ts) — 2(ta)] ots 


The three conditions from Eq. (14.32) are the two simultaneous Euler- 
Lagrange equations 


EOR 
Y Y (14.34) 
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3 oF afar ny (OF 
+ gti) (z ~ L i ôtı + dy(0) E ay E E + 6y(0) (2) 


+ 82(0) Ë A -45 E oR = 0 (14.35) 


Equation (14.34) is a system of two equations that satisfies the original 
criterion of Eq. (14.30). The physical solution to the problem must 
satisfy Eq. (14.34) and, in addition, satisfy the boundary-condition equa- 
tions of Eq. (14.35). However since F does not explicitly depend upon ¢, 
a modification of this set of conditions is, possible: If the first of Eqs. 
(14.34) is multiplied by ġ and the second by 2 and the equations are 
added, an exact derivative is formed the integral of which is as follows: 
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F — Epaia E) - ETE z tê (E) -£ =c (14.36) 


Since F is a function of y and ġ, it is unlikely that dF /dy and ðF/ðğ can 
be zero, or that ôF/ðğ can be a constant with respect to time. Then 


of oA L is not zero. Therefore it is reasonable to have 
dy  dt\ðğ 


ôy(0) = 6y(0) = fii) = fa) = 0 


in Hq. (14.35). A similar argument applies to the variable z. Thus a 
reasonable set of boundary conditions is as follows: 


ne le foe = 0 
- 0 o(t1) = 0 
2(0) =0 g(r) = 0 ee 


62(0) = go(ti) = 0 


These boundary conditions are also the ones corresponding to fixed 
starting values and fixed end values of y, ý, z, and 2, but with a variable 
interval ¢; for the transient. With the conditions of Eq. (14.37), Eq. 
(14.35) shows that the constant C to the right in Eq. (14.36) must vanish. 
The final solution to the problem of Eq. (14.30) is as follows: 


oF oF Ti oF Fi 
F — Ziy Z N Ü gi = zT? s(a) -23 — =Q (14.38) 
and i one of the following equations: 
oF d (2) d? (2) 
a a N Ero oro es 0 
dy at y di? \ dy (14.39) 


OF ad for d (oF 
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Equations (14.38) and (14.39) constitute a system of two equations for 
two unknowns y and z. They are the control equations and are the 
equations for the computer design. 

The boundary condition (14.37) defines the original criteria for that 
duration during which the system moves from one essential operating 
level to another. Thus, if all conditions of Eq. (14.37) must hold, the 
system goes from one definite y, y, z, and 2 to any other definite y, ý, 2, 
and 2. Equation (14.88) is a third-order equation. Equation (14.39) 
is a fourth-order equation. Thus besides the four initial values y, ý, z, 2 
we can still assign three values of ġ, z, 2 at the final y. That is, when 
Y = Y; ý =0,2 =2,andz2=0. A stability device must still be added 
to the system, so that 

y =0 and z2=0 
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at the final point. We see then that although the system considered 
above is considerably more involved than the first-order system dis- 
cussed in the previous sections, yet the same general approach is entirely 
applicable. 

14.7 Control Problem with Differential Equation as Auxiliary Condi- 
tion. Let us consider y to be the essential variable whose performance 
has to be controlled. z is the variable which we put in the system to 
ensure that y can be made to have the desired performance. The 
inherent dynamics of the system then gives one a relation between y 
and z. This relation is in general a nonlinear differential equation, say 


of second order, 
gly, ý gzz t = 0 (14.40) 


The performance of y during a transient is specified, say, as 
j fly) dt = min. (14.41) 


The error integral of Eq. (14.2), would be an example of this type of 
specification. The problem is to derive a control equation satisfying 
both Eqs. (14.40) and (14.41). 

The mathematical problem is then a problem of the calculus of varia- 
tions with the differential Eq. (14.40) as an auxiliary condition. This 
can again be solved by using the method of the Euler-Lagrange multiplier 
A(é).! That is, 

f : F(y,y,932,2,2;t) dé = min. (14.42) 
with 

F = fly) + O)9,9,952,2,2 1) (14.43) 
The only novel feature of the problem is the introduction of the time- 
varying multiplier X(t). The problem of Eq. (14.42) is exactly the same 
as that of Eq. (14.30). Therefore all the equations developed in the 
last section can be used. However, we now have three unknowns: y, 2, 
and à. The three equations are Eqs. (14.34) and (14.40) with F defined 
by Eq. (14.82). Equation (14.40) is intrinsic to the physical system 
and is thus automatically satisfied by the controlled system. What 
has to be artificially enforced is Eq. (14.34). That system of two equa- 
tions then forms the basis of the operation of the control computer. A 
properly constructed control computer then takes information about 
the essential output y, digests it, and then generates a continuous signal 
for z. This signal of z, when fed into the controlled system, then forces 
the system to behave according to the specification of Eq. (14.41). 


‘See for instance O. Bolza, “Vorlesungen über Variationsrechnung,’”’ Chap. 11, 
Teubner, 1909, 
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14.8 Comparison of Concepts of Control Design. In the last chapter 
and in this chapter, we have discussed methods of designing control 
systems when the performance is specified quite rigidly. The method 
of the last chapter, based upon the perturbation theory, is applicable to 
linear systems with time-varying coefficients. The method of this chap- 
ter is even more general in that the system to be controlled may itself be 
nonlinear. For such general systems, these newer methods are the only 
available tools for designing the control; and the resultant complicated 
control systems involving electromechanical computers seem to be the 
only logical solution. However, the methods of the last two chapters are 
equally applicable to the simpler physical systems treated in the earlier 
chapters, 2.¢., linear systems with constant coefficients. For such simpler 
systems, then, we have two different general approaches to the control 
problem. It is illuminating to compare these two design concepts. 

The engine-control problem was treated in Chap. 5, using the con- 
ventional principles of servomechanism. In this chapter, we have dis- 
cussed almost the same problem with the new design method with 
specified performance criteria. One point is immediately clear: the 
control system designed by the older method is linear, and the control 
component can be a simple RC circuit; while the control system designed 
by the new method is nonlinear, and the essential control unit is a 
computer which even in its simplest form is much more complicated 
than an RAC circuit. But this complication is not introduced without a 
gain: while the control system based upon the principle of the conven- 
tional servomechanism may be entirely satisfactory in performance, the 
control system involving a nonlinear computer is guaranteed to give the 
optimum performance—no other control system can be better under 
the same design specification. But this comparison has meaning only 
if we know specifically the desired performance. For instance, if we 
do not know exactly what the temperature integral in Eq. (14.18) is, we 
cannot apply the method discussed in this chapter at all. On the other 
hand, to design a satisfactory control system according to the older con- 
cepts of servomechanisms requires no such sharply defined specification. 

Of course, strict insistence on the best performance must come after 
a clear understanding of what constitutes optimum control behavior. 
Therefore, when we do want an optimum control system, we naturally 
should have the information to define sharply the design criteria. From 
this point of view, then, the newer principles of the last chapters for 
control systems with specified performance certainly go one step beyond 
the conventional theory of servomechanisms and are principles for more 
advanced control design. That the more advanced control system 
should also be more complicated is to be expected. 


CHAPTER 15 


OPTIMALIZING CONTROL 


In the previous chapters we have discussed the design principles of 
control systems with increasing degrees of generality and complexity. 
However, one basic assumption was made throughout the treatment: the 
properties and characteristics of the system to be controlled were always 
assumed to be known. In the case of conventional linear servomecha- 
nisms, the transfer functions of the servos and other components are 
specified before the design. In the case of linear systems with time- 
varying coefficients, we take the example of the guidance system for the 
long-range rocket vehicle. There, the dynamic and the aerodynamic 
properties of the rocket were determined previous to the design of the 
control system. In the case of the general system control according to 
specified criteria treated in the last chapter, the response of the system to 
variations in the controlled input is again predetermined. The control 
design is thus based upon this knowledge of the properties of the system. 
The feedback merely conveys the information on the state of the output 
to the “computer.” The computer then uses its built-in knowledge of 
the system properties to generate the “Intelligent” control signal. 

In this chapter, we wish to relax even this seemingly elementary 
requirement for control design. We wish to introduce the principle of 
continuously sensing and continuously measuring control systems where no 
exact knowledge of the properties of the controlled system is necessary 
for the design. Instead, the properties of the controlled system are 
measured during the control process. In particular, we shall discuss a 
simple case of such control systems, the optimalizing control. 

15.1 Basic Concept. No matter what degree of accuracy we can 
obtain from the control computer, the accuracy of the controlled behavior 
of a system is still dependent upon the accuracy of the information used 
in its design. If we determine the properties of the controlled system 
before we know the over-all design of the control system, as is tacitly 
assumed in the previous chapters, then extreme accuracy of the controlled 
behavior cannot be expected for two reasons: First, the manufacturing 
process always introduces small differences into supposedly identical 
objects. For instance, the wing of a rocket cannot be identical to the 
wing of the rocket model on which the wind-tunnel tests are made to 
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determine the aerodynamic properties. Therefore the aerodynamic prop- 
erties of the rocket in reality must differ slightly from the test results. 
Secondly, any engineering system is subject to small variations with 
respect to time. This may be due to the normal deterioration of the 
system caused by wear and fatigue, or it may be due to the drift of 
conditions in the environment in which the system operates. In short, 
the properties of an engineering system can never be known ezacily 
prior to the instant of actual operation of the system. Therefore, when 
great accuracy of the controlled behavior is a necessity, we must use the 
principle of a continuously sensing control system. 

Nor is the required accuracy of control the only reason for changing 
our concept of control; very often the fact that large unpredictable 
variations of the system properties may occur forces us to use the con- 
tinuously sensing system. We have already introduced this principle 
in connection with the disturbance effects of atmospheric changes in the 
guidance problem of the long-range rocket. There we used the dynamic 
behavior of the rocket itself as an instrument for continuous measurement 
of these effects. But a more illuminating example is the flight of an 
airplane through icing weather conditions. The deposition of ice on the 
surfaces of the wing and the fuselage changes the shape of these airplane 
components. Moreover, the exact manner of deposition is somewhat 
variable and cannot be predicted with accuracy. Therefore the aero- 
dynamic properties of the airplane can be profoundly altered, and altered 
in an unpredictable way, by ice. Even worse, the changes all tend to 
degrade the performance of the airplane, t.e., decrease the number of 
miles that can be flown with one gallon of gasoline. We are thus inter- 
ested in knowing the combination of engine throttle, engine rpm, and 
airplane trim that will give the maximum miles per gallon of gasoline, 
because we should fly the airplane at the optimum condition to conserve 
the strained fuel load. But just at this critical situation, our prior 
knowledge of the airplane performance is rendered useless by the ice 
deposition. Hence the only solution to this problem of cruise control 
in adverse circumstances is an automatic sensing and measuring control 
system, t.e., an optimalizing system which automatically holds the air- 
plane at the measured optimum operating conditions. 

Of course, a skilled human operator controls the performance of a 
machine on the optimalizing principle. He watches the instrument 
readings of the inputs and outputs of the machine, and then uses his 
knowledge and experience to decide in what directions the controls should 
be adjusted. The adjusted inputs bring new output readings, which have 
to be interpreted by the operator to determine whether the optimum 
operating condition is reached or exceeded. New adjustments of the 
controls will have to be made. The continuous adjustment of inputs is 
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the sensing process, and the reading of the outputs is the feedback. How- 
ever, manually controlled optimalizing systems are necessarily slow in 
response, and, for complicated systems, human skill, no matter how well 
developed, is not sufficient. Automatic optimalizing control was con- 
ceived by C. S. Draper, Y. T. Li, and H. Laning, Jr.! Its application to 
cruise control of an airplane was discussed by J. R. Shull.” 

15.2 Principles of Optimalizing Control. The heart of an optimaliz- 
ing control system is the nonlinear component which characterizes the 
optimum operating conditions. For simplicity of discussion, we shall 
assume that this basic component has a single input and a single output. 
For the time being, we shall also neglect any time effects and assume 
that the output is determined only 
by the instantaneous value of input. 
Since there is an optimum operating 
point, the output as a function of 
input has a maximum at yo and ze, as 
shown in Fig. 15.1. Itis convenient 
to refer the output and the input to 
the optimum point, and thus put the 
physical input as x + zə and the 

Xo input Physical output as y*-+y. The 

Fra. 15.1 optimum point is then the point 

z= y* =0. The purpose of an 

optimalizing control is then to search out this optimum point and to keep 

the system in the immediate neighborhood of this point. In this neighbor- 
hood, the relation between x and y* can be represented as 


y* = —kz? (15.1) 


The simplest method of obtaining an optimalizing system, in concept, 
is as follows. Let us assume that we start with a negative input, i.e., an 
input smaller than the optimum input, and that we increase this input 
at a constant time rate as shown in Fig. 15.2a. The corresponding output 
y* will first increase, then reach the optimum value, and start to decrease, 
as shown in Fig. 15.2b. The time derivative of y*, dy*/dt, is then first 
positive, decreases to zero at the point 1 (Fig. 15.2c), and becomes 
negative after that point. At the point 2, the value of dy*/dt reaches the 
critical magnitude designed into the control system so that the direction 
of variation of the input is reversed, and the input x now decreases with 
the same constant rate. y* now increases again, and dy*/dt jumps to 


! Y. T. Li, Instruments, 25, 72-77, 190-198, 228, 324-327, 350-352 (1952). C. 8. 
Draper and Y. T. Li, “Principles of Optimalizing Control Systems and an Application 
to Internal Combustion Engine,” ASME Publications (1951). 

2 J. R. Shull, Trans IRE (Electronic Computers), December, 1952, pp. 47-51. 
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positive values. At the point 3, the output reaches its maximum value, 
and dy*/dt becomes zero again. At the point 4, dy*/dt again reaches 
the critical value, and the drive for the input variation is again reversed. 
This process repeats itself, and the behavior of the system is periodic. 
The system is said to hunt around the optimum point. The period 7” 
is the hunting period. The minimum value of the output is A* and is 
called the hunting zone of the output y*. Because of the parabolic 
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Fig. 15.2 


relation of Eq. (15.1), the average value of the output with hunting is 
1A* lower than the optimum output. This difference is a loss, the 
hunting loss D*, and is the price to be paid for the control. Thus 


D* = }4* (15.2) 


Other characteristics of the system can be calculated in terms of 
A* and T*: By using Eq. (15.1), the extreme values of the input are 
+ 4/A*/k. The rate of change of the input is thus 2 ~/A*/k/T*. The 
critical value of the time rate of output change is —4A*/7*. Therefore, 
if we fix the hunting zone A* or the hunting loss D*, and the hunting 
period T*, the system is specified. The essential elements of such an 
optimalizing system are the test variations of the input, the output 
detection and differentiating device, and the switching of the input drive 
at the predetermined magnitudes. The sensing and searching for the 
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optimum point are accomplished by the forced input variations. But 
the constantly changing input also causes a small loss D of the output. 
It is desirable to make the hunting zone A* small. But small A* also 
reduces the magnitude of the critical dy*/di for input drive reversal. This 
then increases the danger of accidental input drive reversals caused by 
the unavoidable disturbances or noise in the system. Itis apparent that 
if the system is displaced from the optimum point, the recovery time is 
directly proportional to the hunting period T*. A short hunting period 
is thus desirable. But if T* is made too small, it will be difficult to 
differentiate the output variation from the hunting operation and other 
random disturbances. We shall discuss this point again in the next 
section. 

The testing input variation can also be a continuous function of time 
instead of the saw-tooth curve of Fig. 15.2a. For instance, we can make 
the input x be a combination of a slowly varying part za and a sinusoid 
of constant amplitude a and frequency w. Thus 


t = Ta + asin owt (15.3) 
Then, according to Eq. (15.1), the corresponding output y* is 


a? ; ka? 
y* = =k( z2 + Ji~ 2kxra sin wt + =y 008 (2t) (15.4) 


This output signal can be fed to a band-pass filter to remove both the 
slowly varying first term and the double harmonic of the third term. 
The filtered signal is thus —2kzr.a sin wt. Now this signal and the 
sinusoidal signal a sin wt are combined in a rectifying multiplier which 
multiplies these signals to give 


—2kra? sin? wt = — kzal — cos (Qut)] (15.5) 


and then we again remove the double harmonic. We have then, finally, 

the signal —ka’xa This signal can be used to vary the part za of the 
input, such that 

ALa g 

a— = ~ka? 


di Ta (15.6) 


Then za tends to zero with a decay time 27", 


Because of the parabolic relation between input and output, the decay 
time constant for the output is 7*. Therefore such a control system 
will also search out the optimum point and approach it asymptotically. 
The operation of this optimalizing control with continuous test signal is 
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shown in Fig, 15.38. Fig. 15.3c shows the filtered output signal, and 
Fig. 15.3d indicates the effects of the rectifying multiplier. 

When the system is operating near the optimum point, because of the 
sinusoidal oscillation of the input, the output is — ka? sin? wt. Therefore 


—2kx,a sin wt, Fundamental Component of y* 


~ 2kx a Sin? we 


—ka*x,, Output of Rectifying Multiplier 


Fig. 15.3 


there is again an output loss D* = ka?/2. For low losses, the amplitude 
of the test input should be small. But this is again limited by considera- 
tions on incidental ‘disturbances and noise of the system. The hunting 
zone A* of the output is ka?, and thus 


D* = 1a* (15.8) 
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Equation (15.7) shows that the design constant a of the input drive is 
determined by the time constant T* and D* according to the relation 


a = 4D*T* = 2A*T* (15.9) 


The rectified part of the signal given by Eq. (15.5), —ka’ta, obtained 
from the test input of Eq. (15.3), is actually a measure of the deviation 
of the input from the optimum input. The continuous drive of the 
input according to Eq. (15.6) is but one of the many possible uses of this 
signal. Evidently, this signal can also be used to give a saw-tooth varia- 
tion of z4 by using a constant rate variation of input with a superimposed 
sinusoidal oscillation, and by reversing the input drive at critical values 
of the signal |—ka2z,|. The hunting operation of this optimalizing con- 
trol then consists of two separate frequencies, a low-frequency component 
in the variation of £a, and a high-frequency component produced by the 
sinusoidal input oscillation. 

15.3 Considerations on Interference Effects. The previous discus- 
sions on the idealized optimalizing controls have shown the importance 
of reducing the amplitude of the variation of test input and the time 
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constant 7*. However the actual design is limited in this respect by the 
ubiquitous noise and interference in the physical system. In order to 
measure effectively the output variation as a result of input variation 
for testing the distance to the optimum point, it is necessary for this 
output variation with time to be made up of frequency components that 
may with certainty be distinguished from output variations due to noise 
and interference effects. The relative amplitudes of the interference 
output frequency components can be plotted as a function of the fre- 
quency. This output interference spectrum generally has a low-fre- 
quency part, the drift interferences, and a high-frequency part, as shown 
in Fig. 15.4. Between these two parts, there is generally a range of 
frequency relatively free of the noise effects. If the optimalizing control 
is designed with test variations with frequencies in this range, the ampli- 
tude of these test variations can be made small without fear of losing 
them in the background of interferences. In general then, the test func- 
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tion must be made up of input variations that are fast enough to be dis- 
tinguishable from drift interference and at the same time slow enough 
to prevent confusion with high-frequency noise. 

These considerations on the noise effects point out the difficulties 
in both types of optimalizing controls discussed in the previous section. 
The first type, with saw-tooth input drive, uses the time derivative 
of the output as the control signal. If there is random interference in 
the output, then the relative amplitudes of the high-frequency com- 
ponents will be increased by taking the time derivative; thus the range 
of frequency available for optimalizing control will be reduced. This is 
a serious disadvantage. The second type of optimalizing control, using 
the continuous sinusoidal test function, requires a wide band of noise-free 
frequencies, because in addition to the variation of the level of input, £a, 
there is the sinusoidal variation with its own higher frequency w. Thus 
if the system to be controlled has only a narrow range of noise-free 
frequencies, the two optimalizing controls so far discussed are not easily 
applicable. A better system is the so-called peak-holding optimalizing 
control to be discussed in the next section. 

15.4 Peak-holding Optimalizing Control. The input variation for a 
peak-holding optimalizing control is the same as for the first type of 
optimalizing control studied here, a constant rate variation with periodic 
reversals. The essential improvement here lies in the method of generat- 
ing the drive-reversal signal. Reversal of the input drive should occur 
when the output has passed its maximum and has decreased to a value 
approaching the hunting zone limit. This fact itself is used as the con- 
dition for producing the input drive reversal for the peak-holding opti- 
malizing control. Essentially this could be accomplished as follows. 
The output y* is measured as a voltage. This voltage is applied to a 
condenser through a gate allowing only charging but not discharging. 
Then the voltage of the condenser follows the output y* until the maxi- 
mum value is reached. When the input z is increased beyond the opti- 
mum value, the output y* decreases. But the condenser voltage will 
remain at its maximum value, and a voltage difference v exists between 
the condenser and the indicating voltage for the output. This voltage 
difference v builds up to the hunting zone limit A*. At this point a 
relay is triggered to reverse the input drive, and at the same time the 
condenser is discharged to a voltage equal to the instantaneous y*. 
Thus the operation of this optimalizing control can be represented by 
Fig. 15.5. 

The relation between the hunting zone A* and the hunting loss D* 
is the same as given by Eq. (15.2). The extreme values of the input 
are still + «/A*/k, and the input rate is again 2+/A*/k/T*. It is 
seen that the peak-holding optimalizing control has only one essential 
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frequency of output, determined by the hunting period T*, and no 
differentiation of the output is used. It is thus particularly adapted'to a 
system with a narrow noise-free frequency range. In fact a further 
improvement in this direction could be made by basing the input drive 
reversal not directly on v, the voltage difference between condenser and 
output indicator, but the integral of v with respect to time. Then 
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the high-frequency interference effects will be suppressed, and the hunt- 
ing zone and hunting loss can be further reduced without accidental input 
drive reversal. 

15.5 Dynamic Effects. In the discussions of previous sections, we 
have assumed that the input-output relation is specified by Eq. (15.1) 
and is independent of the rate of change of the input or the higher time 
derivatives of input. This is true if the response of the output to input 
variation is instantaneous without the slightest time delay. In any 
physical system, this is not possible; there are always the inertial and 
other dynamic effects. We have then to consider the output y* given 
by Eq. (15.1) as the fictitious “potential output” but not the actual 
output y measured by the output-indicating instrument. y* is equal to y 
only when the time constant T* of the optimalizing control approaches 
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infinity. The relation between y* and y is determined by the dynamic 
effects. But we have seen previously that such dynamic effects can be 
closely approximated by a linear system. If the optimalizing control is 
to be applied to an internal-combustion engine, as was done by C. 8. 
Draper and Y. T. Li, the potential output is essentially the indicated 
mean effective pressure of the engine, while the actual output is the brake 
mean effective pressure of the engine. The dynamic effects here are 
mainly due to the inertia of the piston, the crankshaft, and other moving 
parts of the engine. For small changes in the operating conditions of 
the engine, such dynamic effects can be represented as a linear differential 
equation with constant coefficients. Since we have set the reference 
level of input and output at the optimum input zo and the optimum 
output yo, the physical potential output is y* + yo, and the physical 
indicated output is y+ yo. Thus the relation between the physical 
potential output and the physical indicated output can be written as an 
operator equation 


(y a5 Yo) =F, (2) (y* F Yo) 


where F, is generally a quotient of two polynomials in the operator d/dt. 
In the language of Laplace transforms, Fe(s) is the transfer function. 
Let us call the linear system which transforms the potential output to the 
indicated output used for controlling the input variation, the output 
linear group of the optimalizing control. Then Fe(s) is the transfer 
function of the output linear group. By implication, however, when the 
dynamic effects are negligible, or when s = 0, the potential output is 
equal to the indicated output. Therefore we have the condition that 


F0) = 1 (15.10) 


Then the operator equation between the potential output and the indi- 
cated output can be simplified because yo is a constant. That is, 


y= F, (2) y* (15.11) 
In a similar manner, we can introduce a “potential input” z* which 
is actually the forcing function generated by the optimalizing control 
system but not the actual input z. The relation between z and x* 
is determined by the inertial and dynamic effects of the input drive 
system. This input drive system we shall call the input linear group 
of the optimalizing control. And the operator equation between the 
potential input z* and the actual input z is 


per (2) 5 (15.12) 
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F,(s) is thus the transfer function of the input linear group. Similarly 
to Eq. (15.10), we have 
FO) = 1 (15.18) 


Thus the block diagram of the complete optimalizing control system 
can be drawn as shown in Fig. 15.6. The nonlinear components of the 
system are the optimalizing input drive and the controlled system itself. 

The general relation between the input z and the output y is then 
determined by the system of Eqs. (15.1), (15.11), and (15.12) and by the 
particular optimalizing input drive adopted. For instance, if the 
optimalizing input drive is of the peak-holding type discussed in thelast 
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section, then the potential input x* is a saw-tooth curve with period 2T 
and amplitude a, as shown in Fig. 15.7a. Let 


2 
wo = 7 (15.14) 


Then z* can be expanded into a Fourier series, 
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According to the general relation of Eq. (2.16), the actual input v, given 
by Eq. (15.12), can then be calculated as 


_ 8a (—1)" Qn H1, \ Ee 


n=0 
ntl. 
-n(- EL ine 3 l (15.16) 


The potential output y* is given by Eq. (15.1). Using Eq. (15.16), we 
have 
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By again applying Eq. (2.16), we have, finally, the indicated output y, 
according to Eq. (15.11), 
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Equations (15.17) and (15.18) clearly indicate that the hunting period T 
of the output is only half of the period of the input variation. This is, of 
course, to be expected from the basic parabolic relation of input to output. 

The average value of y with respect to time, being here referred to 
the optimum output y., gives the hunting loss D. Equation (15.18) 
shows that this average value is the sum of second and third terms of that 
equation with n = m. Thus, noting Eq. (15.10), 


32a?k 1 mM+i. \P 
p= mY ar iy r (Z io) 


This equation can be easily checked by observing that when the dynamic 
effects are absent, F; = 1; then the series can be easily summed, and 
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D = D* = a’k/3 = A*/8, as required by Eq. (15.2). Equation (15.19) 
also shows that the average output and the hunting loss are independent 
of the output linear group. This is, of course, to be expected, since the 
level of output is determined by the input z, and is not influenced by the 
dynamic effects of the output linear system. Only detailed time varia- 
tion of the output is modified by the dynamics of the output linear group. 
In the case of an internal-combustion engine, the output level is the 


(a) 


NSAN 


Effective Output 


Critical Voltage Difference 


2 ] 2 
zw mien? T a "y a 4 j t 
(e) i KAULS GUL K 
ý Voltage Difference Input Drive Reversal 


Between Condensor 
& Output Indicator 


Fre. 15.7 


power of the engine. The dynamics of the output linear group is deter- 
mined by the inertia of the moving parts. The power of the engine is 
certainly independent of the inertia of the moving parts. 

The numerical calculation of the output y from Eq. (15.18) for general 
input and output transfer functions is rather difficult. However, any 
practical design of an optimalizing control usually has a rather long 
bunting period T to avoid the high-frequency interference. Then the 
dynamic effects, although not negligible, are not large. In other words, 
we can assume that the ratios of the time constants of the input and 
output linear groups to the hunting period are small, and carry out the 
analysis accordingly. For instance, if the input linear group is approxi- 
mated by a first-order system with the time constant 7;, i.e., 
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F;(s) = (15.20) 


1 
and if r;is small in comparison with T, then the nondimensional quantity 
rw is also small. Under this condition, the first few corresponding 
harmonics of the series of Eqs. (15.15) and (15.16) will have practically 
the same amplitudes, according to Eq. (3.14). The only difference 
between these corresponding lower harmonics in z* and in z is a time 
shift of magnitude r; Therefore, for regions of x* and z away from the 
drive reversal points where the curvatures of x*(¢) and x(t) curves are 
small and where the values are determined mainly by the first few har- 
monics, the z(t) curve merely lags the x*(¢) curve by r; without change 
in magnitude. In going from «* to x, the sharp corners will be rounded 
off, but the general shape of the curves remains unchanged, as shown in 
Fig. 15.7a. If the output linear group is also approximated by a first- 
order system with a characteristic time rə, then similar considerations 
will show that in going from y* to y, the pattern of the curves remains 
the same, but y lags behind y* by ro. This fact is shown in Fig. 15.70. 

With the input transfer function given by Eq. (15.20), the hunting 
loss can be calculated by Eq. (15.19). Thus 
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and, by using the well-known expansion for the hyperbolic cotangent cited 
on p. 90, 


1 > T dry T 
» T+ On $1) %(rieo0/2)? Trw cot a a =| 
n=Q 


Therefore, by substituting the period T from Eq. (15.14), we have, 
finally, 
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When the time lag 7; is much smaller than T, the hyperbolic cotangents 
are approximately equal to unity. Then 
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Since the input amplitude a can be expressed in terms of the input 
drive speed and the period T, Eqs. (15.21) and (15.22) give the hunting 
loss D in terms of input drive speed and the hunting period T, for a 
peak-holding optimalizing control with a first-order input linear group 
of lagz;. These equations apparently indicate that a decrease of hunting 
loss is caused by the lag of the input linear group. However, it is decep- 
tive: for a given critical voltage difference v for input drive reversal, 
determined by considerations of noise and interference, the hunting 
period T and hence the amplitude a will be much larger with the time 
lags r; and 7, than without the time lags. The net result is an increase 
instead of a decrease in hunting loss. 

15.6 Design for Stable Operation. Stability of any control system 
means that the design performance of the system will be obtained even 
with the presence of internal and external disturbances. We have seen 
how this requirement is satisfied in the case of conventional servo- 
mechanisms and other more general control systems in the preceding 
chapters. For optimalizing control systems, the essential part of the 
operation is the proper coordination of the input drive with the output 
behavior, so that the output stays within a close neighborhood of the 
optimum. This operation must not be influenced by intemal and 
external disturbances. When this is achieved by a good design of the 
system, we have stable operation. 

For a peak-holding optimalizing control, we have described the input 
drive signal as the result of the charging and discharging of a condenser 
by a voltage representing the magnitude of the output. The input drive- 
reversal signal is given when the voltage difference v between the out- 
put indicating voltage and the condenser builds up to a critical value 
because of decreasing output. At the instant of reversal of the input 
drive, the condenser voltage is reset by discharging to the output indicat- 
ing voltage at the same instant. When there are dynamic effects, the 
time lags of the input and output linear groups will cause the output to 
continue to decrease even after the signal for input drive reversal has 
been given. Then the voltage v again builds up and will be removed 
only after the output has risen to a value equal to that at the instant of 
signaling for input drive reversal. This is shown in Fig. 15.7c. This 
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positive spurious voltage v between the instants 1 and 2 (Fig. 15.7) is 
undesirable because of the danger of tripping the input drive control 
during this wrong interval of time. To greatly reduce the positive 
spurious voltage, the condenser voltage is reset at the instant of input 
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drive reversal to a lower voltage than the instantaneous output indicat- 
ing voltage. Thus during an interval of time after the reset, the con- 
denser is charged by the output indicating voltage. The capacity of 
the condenser and the resistance of the electric circuit are so chosen 
as to make the voltage of the condenser approach that of the output 
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indicating voltage when the output is increasing again. The variation 
of voltages is shown in Fig. 15.8. The dangerous positive spurious 
voltage difference is thus greatly reduced (Fig. 15.8)), and the stability 
of the control system improved. 

We have already stated that the reduction of the E zone and 
hunting loss is limited by the interference and noise of the system. 
This is again a problem of stable operation: we do not wish to have 
false signals for reversing the input drive caused by interference. Such 
false signals will occur if the critical voltage difference for input drive 
reversal is too small. This can be shown by Fig. 15.9 where the 
output y contains a high-frequency sinusoidal noise. It is easily seen 
that if the critical voltage difference is too small, the noise will trip 
the input drive in an erratic manner. For stable operation, the critical 
voltage difference must be larger than the amplitude of the interference. 
Thus the hunting loss of an optimalizing control cannot be less than the 
interference or noise of the system. Of course, if the interference 
actually were a pure high-frequency sine wave of constant amplitude, as 
shown in the figure, it could be removed by.a filter, and a much smaller 
hunting zone could be used. In fact, if the interference or noise has any 
definite pattern at all, we can design a proper filter to ameliorate this 
limitation. 


CHAPTER 16 


FILTERING OF NOISE 


In all the previous discussions with the exception of the last chapter, 
we have tacitly assumed that the control system does not generate noise 
and interference, so that theoretically there is no limit to the accuracy 
of control. In the last chapter, we have shown that noise and inter- 
ference in fact obscure the output signal used for the optimalizing control 
and are the fundamental design restrictions in such control systems. But 
noise and interference are present in any engineering system, because 
even the “perfect” systems have thermodynamic fluctuations. Their 
effects on the control system are negligible only if the signal is relatively 
strong in comparison to the interference. For the optimalizing system, 
to minimize hunting loss of the output, we design for “weak” signals, and 
thus the problem of noise is of paramount importance. In general then, 
whenever the control signal is weak in comparison to interference, the 
effects of noise and interference cannot be neglected. 

The disturbing influence of noise on the control system can be mini- 
mized by introducing a proper device which will “filter” out the noise 
as much as possible without reducing the strength of the signal. This 
subject of noise filtering is the theme of this chapter. We shall first give 
a discussion of the theory of optimum linear filters developed by N. 
Wiener! and A. Kolmogoroff.? Later parts of this chapter will treat 
the various applications and extensions of this very powerful theory. 
The concepts and mathematical tools introduced in Chap. 9 on random 
inputs are very useful in our present discussion. 

16.1 Mean-square Error. Let f(t) be the control signal and n(t) be 
the noise. The input z(t) to the filter is thus 


a(t) = ft) + n@) (16.1) 


The output from the filter is y(t), as shown in Fig. 16.1. If the filter is 
a linear filter and the differential equation for the output-input relation 
is a linear differential equation of constant coefficients, then the filter 
properties arte completely determined by its transfer function F(s). 


' 1 N. Wiener, “The Extrapolation, Interpolation, and Smoothing of Stationary Time 
Series with Engineering Applications,” John Wiley & Sons, Inc., New York, 1949. 
2 A. Kolmogoroff, Bull. acad. sci. U.R.S.S., Ser. Math., 5, 3—14 (1941). 
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When F(s) is known, the response h(t) of the filter to a unit impulse 
is given by Eq. (2.18). If F(s) has poles only in the left-half s plane, 
we can write 


h(t) = = [- eF (s) ds = = [- etk (tw) dw (16.2) 
The output y(t) due to the input x(t) of Eq. (16.1) is then 
y(t) = i _ oln)ht — n) dy 
assuming that the input extends far into the past. Lett — ņ = r; then 
y() = f a(t — r)hlr) dr (16.3) 


Let the desired output be z(t), which is determined by the signal f(¢) 
and the desired impulse response h(t), t.e., 


2(t) = iA ° At — 2)halr) dr (16.4) 


hi(t) can be calculated from the desired transfer function F(s), and we 


Frc. 16.1 


have, similar to Eq. (16.2), 


TE = i One = L etp lio) dw (16.5) 
Since the actual output is not 2(¢) but y(t), the error e(t) is their differ- 
ence, and, according to Eqs. (16.3) and (16.4), 
e(t) = y(t) — a(t) 
= f° UE- Hnt- ohe) — fE — rhil) dr (16.6) 


The square of the error is then 
eA = f° h E-A) në- Ahe) — fE- Ahe) 
FE — r) + nt — rhal) — f(t — 7')Aa(7’)} dr dr’ (16.7) 


We shall now make a very important assumption. Since the noise 
n(t) is a random function, only its statistical properties can be specified. 


FILTERING OF NOISE 233 


In addition, we do not really know in advance what the signal f(z) will 
be, but only its broad character. Therefore, even for the signal we can 
specify only the statistical properties. Then we can specify the sta- 
tistical error to be measured by the assembly average of e?(t). In 
general this average is a function of the time instant t. But if we make 
the assumption that the random functions f(t) and n(t) are stationary 
as defined in Sec. 9.1, then e? is independent of time. Furthermore, we 
shall assume that both the signal f(t) and the noise n(t) have zero mean 
value. Then it is evident from Eq. (16.6) that the mean value of the 
error e(t) also vanishes. Now we can introduce the following correlation 
functions between the signal and the noise, entirely similar to the corre- 
lation functions of Sec. 9.2: 


Jeo rf — r) ) 
LEST — r’) ) (16.8) 
n= =r) = Rade RF) 

( ) ) 


t / 


Manto e) Shae 
Here we take the general case of nonvanishing correlation between the 
signal and noise. Very often these cross-correlation functions Ry, and 
Rn are zero, and only the auto-correlation functions Rj; and Ran remain. 
The auto-correlation functions are symmetrical functions of the argu- 
ment. The cross-correlation functions are not symmetrical functions, 
but they have the following relations according to their definitions 


Rent’ -7) = Ras(t — 7’) 

16.9 

Rul — T) = Ril — 7’) oe) 

By using these definitions, the mean-square error e? from Eq. (16.7) can 
be written as 


Bm f h Rit — OR) — hea) — halo] 


+ Bglr — r'\[h(r) — Aa(r)\h(r oo op — r')A(r)[h(r’) — hiir’) 
Ran(t — r')h(r)h(r')} dr dr’ (16.10) 


This equation allows the calculation of the mean-square error from the 
correlation functions and the response to a unit impulse. 

Control designers, however, prefer to make the analysis directly with 
the transfer functions F(s) and F(s). To do this, we introduce the 
Fourier transforms of the correlation functions. Let these Fourier 
transforms be ylw), Byn(w), Pay(w), and Brn(w), respectively, defined by 
the following equations: 
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i a 
rlw) = = i : Ryz(r)e-" dr 


Pn(w) = : | Ryn(r)e—" dr 

a (16.11) 
Parlo) = Af . Ras(t)e-" dr 
Panlo) = : l 7” Ran(r)e-™" dr 


Because of the fact that R,,;(7) is a symmetrical function of r, we can write 


UE l ae 
slw) = if Rys(r) Cad + ee) dr = 2 | R,;(r) cos wr dr 


By comparing this equation with Eq. (9.23), we see immediately that 
the function $;;(w) is actually the power spectrum of the signal f(¢). 
Similarly, @nn(w) is the power spectrum of the noise n(t). Furthermore, 
since the cross-correlation functions are related as shown in Eq. (16.9), 
it is easily seen that the Fourier transforms are connected in a similar 
manner: 

Pl — w) = Barlo) | 

Barlo) = Brala) | m 


According to the Fourier integral theorem,! the inverse of Eq. (16.11) 
is 

Rit =r) = 3 EC. "by (w) oO do 

harar) = T Pinloe dw 

) 

Je’ 


16.13 
Rast — r’) = e Parlow’ dw i ) 


By substituting Eq. (16.13) into Eq. (16.10), we can obtain an equation 
for the mean-square error in terms of the transfer functions F(s) and 
F(s). For instance, the first part of Eq. (16.10) becomes 


Bo ae foo ae | des BANE) — MN) — halo] 
=4 L du B, (co) h ° hlr) — hlr) dr h ” Al) — hlr jei dr 


But F(s) and Fi(s) are the Laplace transforms of A(t) and hi(t), i.e... 


! See for instance Whittaker and Watson, “Modern Analysis,” Sec. 6.31, p. 119, 
Cambridge-Macmillan, 1943. 
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F(iw) = h * alten dt | 


: (16.14) 
F(t) = i. hi(t)e* dt 


Hence the first part of Eq. (16.10) can be written as 
z fT, BoE Go) — Fuiw)|[F(— tw) — Fal iu)] de 


Other terms in Eq. (16.10) can be converted in a similar manner. We 
then obtain, finally, 


P = g f7, EOE) — Fio) — Filio) 


+ Dralo) Eio) — Fi(—t0) | w) + Bns(w)F(—tw)[Fiw) — Filie)] 
+ Bin(w)F (io) F (—iw)} dw (16.15) 


The integrand within the brace can be considered the power spectrum 
of the error e(t). The last term of the integrand is in fact the power 
spectrum of the filtered noise, according to Eq. (9.71). Evidently the 
first and the last terms are real. The second and the third terms are 
complex. However, because of Eq. (16.12), these two terms are complex 
conjugates, and therefore their sum is real. 

16.2 Phillips’s Optimum Filter Design. With the statistical proper- 
ties of the signal and the noise given as the various correlation functions, 
the Fourier transforms can be computed by using Eq. (16.11). Then 
if the transfer function F(s) is specified, the only function in the mean- 
square error integral of Eq. (16.15) yet to be fixed is the transfer function 
F(s) of the filter. The optimum filter is then the filter which has a 
transfer function F(s) such that the mean-square error is the minimum 
for specified #’s and Fy(s). A straightforward method for solving this 
problem of optimum filter design is to assume a reasonable form for 
F(s), but with undetermined constants. Then e? can be determined as a 
function of these undetermined constants by substituting the assumed 
F(s) into Eq. (16.15). The constants are finally determined by requiring 
that the mean-square error must be a minimum with respect to these 
constants. The optimum filter design is thus reduced to a problem of 
finding the maximum or minimum of a known function. R. S. Phillips,! 
in fact, worked out such a theory of the optimum filter by taking F(s) 
to be a ratio of two polynomials in s. This form of F(s) is indeed a 
natural one, because our experience of linear systems indicates just such 
a choice. The actual calculation involved, however, is considerable. 
For this reason, the more elegant theory of Wiener and Kolmogoroff is 


1 R. S. Phillips, “Theory of Servomechanisms,”’ MIT Radiation Laboratory Series, 
Vol. 25, Chap. 7, McGraw-Hill Book Company, Inc., New York, 1947. 
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generally preferred, and we shall not pursue the Phillips theory any 
further here. 

16.3 Wiener-Kolmogoroff Theory. The theory of the optimum filter 
by Wiener and Kolmogoroff is based upon an application of the calculus 
of variations to the integral of Eq. (16.15). If F(s) is indeed the optimum 
filter with fixed 6’s and F(s); then by forming the neighboring function 
F(s) + n(s) where n(s) is the arbitrary variation and by substituting this 
neighboring function into Eq. (16.15), we find the first-order variation 
of the mean-square error as 


a 


de? = f° n(— te) {F(G)[Bjp(0o) + Erlo) + Basla) + Banla) 
— Fy(iw)[Bss(w) + Brz(w)]} dw 
+i |, Mio) {F(—iee)[Bjp(w) + Brala) + Barla) + Saale) 
— Fy(—iv)[Bjr(w) + Sja(w)]} deo (16.16) 


If F(s) is the optimum filter transfer function, then the variation Se? 
should vanish for arbitrary (s). This condition will yield an equation 
for F(s). However, before we can actually take this step, we have to 
make some very important modifications of Eq. (16.16). 

First, the power spectra ®,;;(w) and ®,.(w) are symmetrical functions 
of w. Thus by taking into account Eq. (16.12), we see that the sum of 
Gr, Brn, Pas, aNd Pan 18 an even function of w. It is thus reasonable to 
expect that this sum & can be “factored” so that 


Pw) = Brp(w) + Byn(w) + Pay(w) + Pan(w) = V(tw)V(—iw) (16.17) 


W(s) is by definition a function with poles and zeros in the left-half 
s plane. W(—s) then is a function with poles and zeros in the right-half 
s plane. Now the transfer function of the filter, for reasons of stability, 
can have poles only in the left-half s plane. Thus F(s) and n(s) are 
functions with poles only in the left-half s plane. F(—s) and n(—s) 
are functions with poles only in the right-half s plane. Thus the physical 
requirements limit the class of functions for F(s) and n(s). With this 
understanding, we can rewrite Eq. (16.16) as 


@ =i a E E | Plueio) a Paeete E S| ia 


+4 | niia) |F iuni 
_ Elato) [rlo + Pfn(e)] 
V (tw) 


However, not all of the second terms in the braces of Eq. (16.18) are 
important. If H(s) and K(s) are functions with poles in the left-half 


l deo (16.18) 
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s plane, and if for large s they behave like 1/s*, with n > 1, then 
in H(iw)K (iw) dw = if H(s)K(s) ds 


where the closed path of integration of the second integral is the imaginary 
axis and the semicircle enclosing the right-half s plane, as shown in 
Fig. 16.2a. But the singularities of H(s)K(s) are outside this path, and 
hence the value of the integral is zero. For the product H(—1w)K(—iw) 
with singularities in the right-half s plane, the path of integration can be 
made to enclose the left-half s plane, as shown in Fig. 16.2b. Then 


s- plane 


Poles in this 
Half Plane 


Poles in this 
Half Plane 


(a) (b) 
Fre. 16.2 


again no singularities will be enclosed, and the value of the integral will 
be zero. For products H(iw)K(—iw) or H(—iw)K(iw), the path will 
always enclose some singularities, and the integral has value. Thus 
I, H (iw) K (iw) do = m H(—iw)K(—iw) do =0 (16.19) 
But 
[7 Hie) K(—iw) dw # 0 


a (16.20) 
fT, Hie) K Gin) des = 0 


With these facts in mind, we divide F'1(s)$,;(s/2)/W(—s) into two parts: 
one part with singularities in the left-half s plane, denoted by [ J., and 
another part with singularities in the right-half s plane, denoted by [ JL. 
That is, 


Fy (tw) {Byp(w) + Parlo) _ k (iw) {P¢p(w) + B | 
P(— iw) V(—t) + 


n Eo n BA 16.21) 
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Then because of Eqs. (16.19) and (16.20), Eq. (16.18) can be written as 


i] 


ia =i J aei) 


OLG — Pego Bar| | dw +4 a n(tw) V (iw) 


[riori 7 [Edie af Sle) | | do (16.22) 


Now if F(s) is indeed the transfer function of the optimum filter, then 
õe? should vanish for arbitrary (s). Thus the quantities in the brackets 
of Eq. (16.22) must vanish. This condition determines the optimum 
transfer function as 

F(s) = 1 Fale {®,;(s/2) A 6,;(8/2) a (16.23) 


¥(s) v(—s) 


This is the solution of the optimum filter problem given by Wiener and 
Kolmogoroff. F(s) has poles only in the left-half s plane, since ¥(s) has 
zeros only in the left-half s plane. F(s) is thus a stable transfer function. 
The operation of picking the part of Fi(s){y;(s/2) + S,y(s/z) }/¥(—s) 
with poles in the left-half s plane can be also done analytically. In fact, 


F(s) = =e i ome di a Pg Ee Baal go do (16.24) 


The validity of this equation can be easily verified by contour integra- 
tion in the complex w plane. Which of the two equations (16.23) and 
(16.24) is to be used for actual calculation depends upon the individual 
cases. Usually, however, Eq. (16.23) is much easier to use. In any 
event, the properties of the optimum filter are completely determined 
by the specified operation F(s) and the spectra ©;;, Pin, Bay, and Pax of 
the signal and noise. When noise is absent, Pan = By, = ns = 0, and 
Sylo) = V(tw)¥(—tw), according to Eq. (16.17). Then F(s) = F(s) 
as expected, and the error e(t) is always zero. When there is noise, F(s) 
is not equal to Fı(s), and the mean-square error cannot be eliminated 
even with the best filter. 

A different interpretation of the operation of picking the part of a 
function with poles in the left-half s plane can be given. As shown by 
Eg. (16.2), we can consider F(iw) to be the Fourier transform of the 
response A(t) to a unit impulse. That equation also shows how to 
calculate h(t) from F(iw). Because of the factor e*t in the integrand of 
Eq. (16.2), the proper paths to take in the complex w plane are different 
for positive £ and for negative ¢, and are shown in Fig. 16.3. If F(s) 
has poles only in the left-half s plane, then F(iw) has poles only in the 
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upper-half w plane. If F(s) has poles only in the right-half s plane, then 
F(tw) has poles only in the lower-half w plane. Thus for t > 0, the 
path will enclose poles of F(iw) only if F(s) has poles in the left-half 
s plane, and in that case the response h(é) will be different from zero. 
For ¢ < 0, the path will enclose no pole of F(iw) only if F(s) has all 
its poles in the left-half s plane, and then A( = 0. On the other hand, 
if F(s) has poles in the right-half s plane, the case of unstable F(s) accord- 
ing to the present interpretation, h(t) will be different from zero even 
for negative ż& Since the impulse is applied at ¢ = 0, response at nega- 
tive ¢ means response before the impulse is applied. Such behavior is 
Contour for ¢>0 


Enclosing Upper 
Half Plane 


\ 
Complex 


weplane S _-~* \ Contour for £ <0 


Enclosing Lower 
Half Plane 


Fia. 16.3 


impossible for any physical system. Therefore we can consider the 
operation of picking the part of function with poles only in the left-half 
s plane as the operation of making the transfer function physically 
realizable, so that h(t) = 0 for? <0. An explanation of the Wiener- 
Kolmogoroff solution of Eq. (16.22) and (16.23) based upon this concept 
of a physically realizable transfer function was given by Bode and 
Shannon. 

There remains one point to be cleared. One of the assumptions in 
the present theory is the possibility of factoring the sum of spectra as 
shown in Eq. (16.17). This is not always possible for an even positive 
function @(w) of w. To make factoring possible, #(w) must satisfy the 
Wiener-Paley criterion,” oe 


“ llog &(w)| . 
bab Beh © 16.25 
f 1 +w’ dw < ( ) 
Actually @(w) is either a constant, as incase of white noise, or'approaches 
zeroasw— ©. The Wiener-Paley criterion says then that the approach 
1H, W. Bode and C. E. Shannon, Proc. IRE, 38, 417-425; (1950). 
? R. E. A. C. Paley and N. Wiener, “Fourier Transforms'in the Complex Domain,” 
Am. Math. Soc. Colloquium Publication, Vol. 19, p. 17, 1934.. 
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to zero at large w should not be too rapid. An approach of the type 
w is allowed, but an approach like e~l+! or e’ will make the integral 
diverge. Thus a (w) of the latter types cannot be factored. Fortu- 
nately the spectra of the actually occurring signal and noise are generally 
ratios of polynomials of w?. Factoring according to Eq. (16.17) is thus 
usually possible. 

16.4 Simple Examples. As a simple example, we take the power 


spectrum of the signal to be 


1 
Plo) = E 


(16.26) 
The noise is assumed to be white noise. Then its power spectrum is 


flat and a constant; thus 
Pan = N$ (16.27) 


The cross-correlation functions and their Fourier transforms thus vanish, 


1.€., 
Dy, = Bry = 0 (16.28) 


Let the problem be to design the optimum filter for differentiating the 
signal, that is, Fi(s) = s. First, we note that 
4 
B(w) = Pyu) + Parlo) = ta 
w 
Thus 
(1 + nt) + nés* 
1 + s* 


Obviously to factor this function is quite straightforward. Remembering 
that Y(s) has zeros and poles only in the left-half s plane, we can immedi- 
ately write down W(s) as 


_ ms tnavVv2Vitnis+ Vl tn 


(s/t) = ¥(s)¥(—s) = 


V(s) 


s$? + /2 s+1 
Then 
Fii(s)®ps(8/t) _ 8 
¥(—s) (2 + +/2s+1)(n2s? —nvV2VW1 + nis tV FA 


m As +B p Cs + D 

$ HVS +1 nmt- nV yl pnis t Vit ni 

where A, B, C, and D are constants. It is evident that the part with 
poles only in the left-half s plane is the first term. Hence 


oo As+B 
+ 


¥(—s) ~ @ 4+ 4/2g+1 
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By solving for A and B, we have, finally, 


ee = ae | PPE E. ET 
W(s)| (=s) j myI Fn + VIF nA 
(W1+nt—n)s—nvV/2 
ws +n VVI tnts + V1 +n! 

This is the transfer function for the optimum filter. As the noise is 
reduced, n — 0, and F(s) approaches s as it should. 

Another interesting example is the case of very high noise intensity 
and a weak signal. Let the noise be white noise and the power spectrum 
of the noise ®,, be 


(16.29) 


Panlo) = 1 (16.30) 


Here again there is no cross correlation between noise and signal, and 
Eq. (16.28) remains true. Let the power spectrum of the signal be 


$;;(w) = kelo) (16.31) 


where k is a small quantity and (w) is an even function of w. If K(s) 
is the part of y(s/t) with poles only in the left-half s plane, t.e., 


xo =[o()] -4 [ ema f7 oada assy 


then since g(w) is even in w, 


e(3) = K(s) + K(-s) (16.33) 
and 


O(w) = V(iw)W(—iw) = 1 + ke(w) 
= [1+ kK(tw)][1 + kK(—iw)] (16.34) 


Therefore if F(s) represents the desired operation on the signal, the 
optimum filter is given by 


o k Fi(s)¢(s/1) 
eres RRC, Aa 


This is the second approximation for small k. The first approximation 
is even simpler, 


F(s) = k or o), (16.36) 


As a specific example, let the power spectrum of the signal be specified by 


1 
olw) = ES 
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and let F,(s) = s. Then when kis small, 


k 1 
i ae ocr ree eae 


This result checks with Eq. (16.29) by making n very large and k = 1 fnt. 
Thus under strong noise interference, the optimum differentiating trans- 
fer function is very much distorted and bears no resemblance at all to 
F(s) = 8. 

16.5 Applications of Wiener-Kolmogoroff Theory. Besides the sım- 
ple examples discussed in the preceding section, there are a number of 
very important applications of the Wiener-Kolmogoroff theory. We 
shall consider several of them in this section. 

Predicting Filters. These filters take the input z(t), the sum of signal 
f(t) and noise n(#), and give an output y(t) which is the closest representa- 
tion of the signal not at t but at t + a, where a is positive. Thus 


F(s) = e% (16.37) 


Now let us assume that the signal is a random switching function. Then 
according to Eq. (9.50), the power spectrum can be written as 


1 


Dry = ESE (16.38) 
The noise, assumed to be white noise, has a power spectrum 
Onn = n’ (16.39) 


The cross spectra vanish. Then 


E E a 


L + w? 
Therefore 
.. VIL ++ nie 
Veoh 1 + tw 
Hence 
Py (tw) yp(@) giae 


VW(—iw) (1 + iw) (4/1 Fm — nio) 


In this case, it will be necessary to use Eq. (16.24) for calculating F(s). 
First of all, for ¢ > 0, 
1 2 Filiodlo) TE I Ë getta des 
ey pee @ T Ja (1 + tw)(V/l + n? — niw) 
g` ira) 


a tVitn 
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Hence, according to Eq. (16.24), 


o —(t+a) 
F(s) = es i P| ea alee dt 
VIF nso nyl +n 


(1 + ee i ° p(t df 
(ntYltn)(V71 +n? + as) 


Therefore, finally, the optimum predicting filter of predicting time a is 
characterized by the transfer function F(s), 


et i 
F(s) = ———————- — 16.40 
(nt VI +n?) (fl + 2? + ns) oe) 
Lagging Filters. This type is similar to the predicting filters except 
that now the “predicting” time ais negative. A straightforward adap- 
tation of the preceding calculation to this case will not lead to success. 


f(t) 


Fie. 16.4 


In fact there is no linear system of finite order that will exactly fulfill 
the role of an optimum lagging filter. A more direct solution can be 
obtained by using the approximation 


on | LH (as/27) |" a<0 
Fils) = e = pa- (a/n) v = integer 


We then obtain an approximation to the optimum lagging filter. 

Servomechanisms with Noise. Let F;(s) be the transfer function of 
the forward circuit and F;(s) be the transfer function of the feedback 
circuit of a feedback servomechanism, as shown in Fig. 16.4. Let F(s) 
represent the desired operation on the signal. The problem is to find 
the optimum F;(s) with F;(s), Fi(s), and the signal and noise properties 
specified. As shown in the figure, the equivalent F(s) is 


(16.41) 


= F;(s) 
*) = TT FORE 
But according to the theory, the optimum F(s) is given by Eq. (16.23) 
or Eq. (16.24). Knowing F(s), we can obtain the optimum transfer 
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function F;(s) for the feedback circuit as 


1 1 

F(s) = FO 3 (16.42) 

Noisy Servomechanisms. In the previous paragraph we assumed the 
source of noise to be outside the servomechanism and the servo itself 
quiet. However, very often there is noise generated in the servo- 
mechanism. For instance, the system shown in Fig. 16.5 has the external 
noise n(t), and in addition an internal noise m(t) from the output measur- 
ing instrument, or an output disturbance. Here again, let F(s) be the 
transfer function of the forward circuit, F,(s) be the transfer function 


f(t) 


Fig. 16.5 


of the feedback circuit, and F1(s) be the desired operation on the signal. 
Let the Laplace transforms of f(z), nt), mÒ, yÈ, and e(t) be S(s), N(s), 
M(s), Y(s), and Z(s), respectively. Then 


Fil) (S(s) + Ns) — Fe(s)[¥(s) + M(s)]} = Y(s) 
Z(s) = F4(s)S(s) 
Then the Laplace transform E(s) of the error eð is 


F(s) 
ONG Fs) Fs) [S(s) + N (8)] 


an 


E(s) = Y(s) — Z(s) = 


F,(s)F(s) 

pao -rp ORG MO - POS) 
et us pu 
_ __F;(s)Fo(s) 
9 = TRO (16.43) 

Then we have 

1 

1- FO) = Rane 


and 
E(s) = [1 — F(s)[Fs(s)S(s) + Fy(s)N(s) — Fi(s)S(s)] 
— F(s)[M(s) + Fi(s)S(s)] 


This equation indicates that our servo problem is equivalent to the filter 
problem, with the filter transfer function F(s), and the equivalent signal 
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input S’(s) and equivalent noise input N’(s) given by 


S(s) = {Fs(s) — Fils) }S(s) + | 
N'(s) = M(s) + Fi(s)S(s) 


The original problem of optimum /;,(s) is now reduced to that of optimum 
F(s), with the equivalent signal and noise independent of the unknown 
F,(s). 

Let us assume that the original signal and noises are independent 
and only autocorrelations exist. Then we have only the power spectra 
Pss, Pun, ANd Bam. By using Eq. (16.44), the #’s of the equivalent filter 
problem are found, 


(16.44) 


ŝ 


py (:) = [F,(s) — Fi(s)[F7(—s) — Fi(—8)]Sy; (2) 
+ Fy(s)Fy(—s8)@m (:) 
Dyin! (:) = [F;(—s) — Fi(—s)]Pi(s) Sy, (:) (16.45) 


(7 


Puy (:) = F,(—s)[Fs(s) — Fil) (:) 


Bata! (:) = Bim (2) + Fy(s)Fi(—s) B55 (:) 


Thus the equivalent filter problem has cross spectra yy and wy, in 
spite of the uncorrelated signal and noises in the original problem. The 
function to be factored is then, using Eq. (16.45), 


Plo) = V(iw)V(—1w) = Fy(tw)F's(— iw) {rlo + Panlo} 


+ Bnm(w) (16.46) 
The optimum F(s) is, according to Eq. (16.23), 


F(s) = me 
ON SAS = Oe) (16.47) 
W(—s) + | 


When F(s) is known, Eq. (16.48) gives the optimum transfer function 
F(s) for the feedback circuit as 


1/F;(s) 
Fils) = —o 16.48 
8) = ro- ee 
Saturation Constraint. Consider the servomechanism of Fig. 16.6, 
designed to make the output y(¢) follow as closely as possible the input 
f@) = x(t). The transfer function of the amplifier is F.(s), and the 
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transfer function of the servo motor is F,(s). If the design condition 
is to make the mean square of the error e(t) = y(t) — f(t) as small as 
possible by varying F,(s), then the power of the control input to the 
servo motor may reach a very high level during operation. To avoid 
this, we must specify that the average of the power of the input to the 
motor must be a specified value. The mean-square error in this case is 


3a] l i l Falio)Pnlio) 1 
2 J o| 1+ Falta) Fn (tw) 


Fa(—iw)Fn(— iw) 
oe = 1 Byy(w) dw (16.49) 


The average power of the input to the servo motor is represented by the 
mean square of the servo motor signal. This is to be kept at the fixed 


f(t) 


Fie. 16.6 


value o2. Then 


0 
o = 4 
— ð 


By using the Lagrange-multiplier method, this problem of minimizing 
e? with the constraint of Eq. - (16.50) can be converted into minimizing 
e? + do’, à being the constant amplifier. Therefore the integral to be 
minimized is 


F (10) 


2 “TR 
T+ Fy (iw) F'n (tea) rlw) dow (16.50) 


e + ro? = 4 I Í G (iw) — IE (— iw) — 1dr) 


+ Fliw)F(— iw) TO dw (16.51) 
where 
Fals)En(8) 


1 + F,(s8)F'm(s) 


Comparing Eq. (16.51) with the integral of Eq. (16.15), we see that 
the present problem is equivalent to the filter problem with Fi(s) = 1, 
Pr, = Paz = 0, and the equivalent noise power spectrum 


F(s) = (16.52) 


_ __ Mylo) 
Poole) = F iFa E) 
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The function (w) to be factored is thus 


P(w) = V (tw) ¥(— tw) = f + IT| Prl) (16.53) 


According to Eq. (16.23), the optimum F(s) is given by 


_ 1 | B(s/2) 
r= sq | Feo |, R 
Equations (16.52) and (16.54) determine the transfer function F,(s) of 
the optimum amplifier, with the exception of the constant A. This con- 
stant is fixed by the power level o?, according to Eq. (16.50). 

The discussion in the preceding paragraphs has perhaps demonstrated 
the wide range of problems that can be solved by the Wiener-Kolmogoroff 
theory. In fact, the problem of servomechanism design for random 
input, formulated in Sec. 9.12, can also be conveniently solved by this 
theory. Such an application to servomechanisms is another example 
of design according to specified criteria, a subject treated in its generality 
in Chap. 14. Whenever such criteria can be formulated, the optimum 
system characteristics are completely determined. Furthermore, because 
of the particular type of criteria chosen and the properties of the con- 
trolled system, the resultant control system is linear and has constant 
coefficients. This concept of a more specific design of servomechanisms 
is thus one step beyond the design principles of feedback servos discussed 
in the earlier chapters. Boksenbom and Novik! were perhaps the first 
to suggest this particular application of filter theory. 

16.6 Optimum Detecting Filter. In many control systems, the prob- 
lem is the detection of the signal f(t) under heavy noise interference. 
In this problem, the pattern of the signal is generally known; what has 
to be detected is the time instant tọ when the signal has reached the 
expected value f(t). For instance, in the case of radar, we know that 
the signal is a pulse having a specified shape. The problem is to know 
when the signal has reached its maximum strength. If the signal has its 
maximum at the instant to, then the filter should give the least distortion 
at the instant t. Therefore the optimum filter must be so designed 
that the value of the signal after filter action, f,(¢), at the instant te is, in 
fact, f(t). Thus the constraint is 


folto) = f(t.) = a const. (16.55) 
The noise input ta the filter is n(Z); the corresponding output is no(t). 
‘We wish to reduce the noise as much as possible by filtering, and thus 


we can write 
n2(t) = min. (16.56) 


1 A. S. Boksenbom, D. Novik, NACA TN 2939 (1953). 
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The problem is to determine the transfer function F(s) of the filter or 
its equivalent, the response h(é) of the filter to a unit impulse, with given 
f(é) and to, and the noise characteristics Ran OF Paa, such that 


ne(t) — 2rfo(to) = min. (16.57) 


where is the Lagrange multiplier. The solution of this problem is 
given in this generalized form by Zadeh and Ragazzini.’ We shall follow 
their analysis. 

If Snn(w) can be factored, so that 


nnlo) = V(iw)¥(—t) (16.58) 


where Y(s) has zeros and poles only in the Jeft-half s plane; then it is 
convenient to consider two amplifiers in series, where one has the transfer 


cre errant ere , 


Fria. 16.7 
function 1/¥(s) and the other the transfer function V(s), both in series 
again with the filter, as shown in Fig. 16.7. This system is equivalent 
to the original system. We consider the last two transfer functions as 
one unit F(s), t.e. 
F’(s) = W(s)F(s) (16.59) 


with the response h’(¢) to an impulse. The inputs to F’(s) are the signal 
f’( and the noise n’(f). If S(tw) is the Fourier transform of the signal 
FO), te., 

Slo) = fÀ JO at (16.60) 
then 


f 1 {° Ste) . 
FÒ = an [. a em dw (16.61) 
The power spectrum of n’(t) is now, according to Eq. (9.71), 


Panlo) z 
¥ (tw) ¥(—1w) 


Therefore the noise is now a white noise with the autocorrelation function 


Realr) = ô(r) (16.62) 
1L. A. Zadeh, J. R. Ragazzini, Proc, IRE, 40, 1228-1231 (1952). 
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The outputs from the system can now be written as 


f(t) = i hi (r) f(t — 7) dr (16.63) 
and 


no(t) = i * hirn’ (t — 1) dr. (16.64) 


The mean square of output noise is then the assembly average of n2(t). 
Or 


AD) = [ae f° arn oN WE TE 7) 
= k i K(k (t) Reelt — r) dr dr 
But the correlation of noise n’(é) is given by Eq. (16.62). Hence 


m = i. * Tal) de (16.65) 


Therefore the minimization condition of Eq. (16.57) can be converted 
by Eqs. (16.63) and (16.65) to 


h WOP at — 2 4 h'(t)f"(to — t) dt = min. 
This can be rewritten as 
ie [h’ (t) = Af (i = t)? dt — X? a [f (to = ty}? dt = min. (16.66) 


But with f(t) and ®,x(w) specified by the problem, f’(t) is a fixed function 
calculated by Eq. (16.61). Therefore the second integral of Eq. (16.66) 
is a fixed constant. The first integral is positive or zero. For the 
minimum of the sum, the first integral of Eq. (16.66) should be zero. 
This is so only if the quantity within the square bracket of that integral 
vanishes. Thus | 

KO =M tad fort > 0 (16.67) 


Naturally, for physically realizable systems h’(t) = 0 for ¢<0. In 
other words, the optimum response of F’ to an impulse is identical, 
for positive £, with the image of f’({) with respect to to/2. This result 
for white noise has been known for some time and was first derived by 
D. O. North. 

With the result of Eq. (16.67), the optimum filter F(s) of the original 
problem can be immediately written down with the help of Eqs. (16.59) 
and (16.61), 

S(—iw) 


Z À j —st : iu (t—to) 
F(s) = | dte a TEA e do (16.68) 
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where \ is to be finally fixed by the constant f(to) in Eq. (16.55). This 
result is very similar to that of Wiener’s optimum filter given by Eq. 
(16.24). In fact we can write 


A | ess) 
PO = 35 [Gow | eit 


where [| |, again denotes picking the part of the function within the 
bracket having poles only in the left-half s plane, or making the transfer 
function physically realizable. Equations (16.68) and (16.69) are the 
formulae for optimum detecting filters given by Zadeh and Ragazzini. 
The derivation is made elementary by using the artifice of an equivalent 
problem (Fig. 16.7), in which the noise component of the input is a 
white noise. This device is generally very useful in simplifying the 
analysis of complex optimization problems. 

16.7 Other Optimum Filters. One of the basic assumptions in the 
filter theory discussed in the preceding sections is the stationarity of the 
random functions, either as signal or as noise. No true stationarity can 
hold for very long time, because of the natural drift of the system or a 
purposeful change in the state of operation. More likely, the random 
inputs are stationary only for a limited time interval T. For time inter- 
vals longer than T, the random functions are not stationary. Therefore, 
if the filter is designed with the assumption of a stationary random 
function and if the characteristic time of the filter is much larger than T, 
reality will not correspond to the theory, and the performance of the 
filter suffers. In such a case, it would be better to deviate from the 
theoretical “optimum” filter and use one with a shorter characteristic 
time. 

A more satisfactory solution is to include the time T explicitly in our 
theory. We can do this by requiring the impulse response h(t) of the 
filter to be zero outside the interval 0 < t < T. Then the output y(t) 
is calculated from the input z(t) as follows: 


y(t) = i * Mrl — +) dr (16.70) 


This equation demonstrates the fact that the output is dependent upon 
the input only for a finite time 7 back from the present. Therefore such 
filters may be called filters of finite memory. The filters discussed in 
the previous sections, having T — ©, are thus infinite-memory filters. 
Finite memory filters are discussed by Zadeh and Ragazzini. They? 
give a solution of the optimum filter for the problem of detection similar 
to that of the preceding section. They? also give a solution of the 


1 Zadeh and Ragazzini, op. cit. 
2? L. A. Zadeh, J. R. Ragazzini, J. Appl. Phys., 21, 645-654 (1950). 
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optimum filter for a more complicated problem with the signal input 
composed of two parts, a stationary random function and a nonrandom 
function expressible in a polynomial of nth degree in#. For this problem 
of a finite-memory filter, the performance criteria are, first, vanishing 
mean error, and second, minimum mean-square error. The vanishing 
of the mean error is no longer automatically ensured because of the 
nonrandom part of the signal. The solutions given by their theory for 
these problems are, however, generally difficult to realize by simple 
RC circuits. In fact, even for the simpler problem of infinite-memory 
filters, the solution specified by Eqs. (16.68) and (16.69) is sometimes 
difficult to build. Practical filters can be only approximations to the 
theoretical optimum. Then the value of the theoretical solution lies 
mainly in giving a guide for design and a standard of ideal performance. 

16.8 General Filtering Problem. Of course, it will be still possible to 
use the complicated theoretical optimum filter design if we abandon the 
inadequate RC circuits and use an analog computer or even a digital 
computer to serve as the filter. Then the theoretical optimum perform- 
ance can actually be attained. However, the introduction of an electro- 
mechanical computer as a component of the filtering system greatly 
increases the complexity of the over-all system and can be justified only 
in very critical cases. If we have made the system very complicated at 
high cost, we may ask whether we have actually obtained the very best 
performance. The optimum performance in the theory discussed in the 
previous sections is only optimum within the limitations of the assump- 
tions of the theory. For instance, two random signals with the same 
correlation function or the same power spectrum, according to the theory 
developed, require the same optimum filter. This is, in a sense, a certain 
looseness in design criteria. Surely, if we have more statistical informa- 
tion about the signal than just the power spectrum, we should be able 
to distinguish these two signals and to improve our design by utilizing 
such additional knowledge. Then we can obtain even better perform- 
ance than possible with the so-called “optimum” filter. It is evident 
that this generalized approach to the filtering problem must require a 
more advanced theory of probability than we have used. The recently 
developed science of information theory may also find important applica- 
tions here. A beginning! has been made in this “ probabilistic” approach 
to the problem of detecting a signal in noise. But much remains to be 
done. 

The Wiener-Kolmogoroff theory of the optimum filter is based upon 
the mean-square-error criterion. By using this criterion, we essentially 

1 See for instance P. M. Woodward, I. L. Davies, Phil. Mag., 41, 1001-1017 (1950); 


Proc. IRE, 89, 1521-1524 (1951); J. Inst. Elec. Engrs. London, 99(3), 37-51 (1952); 
and T. G. Slattery, Proc. IRE, 40, 1232-1236 (1952). 
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put the emphasis on minimizing the large errors, without much consider- 
ation of the small errors. However, on many occasions, we may be most 
interested in making the frequent error as small as possible, while not 
being particular about an infrequent large error. It is also possible 
that the probability function is very lopsided, with the mean far from 
the mode. For such cases, the mean-square-error criterion is entirely 
inappropriate. As a simple example, we consider the problem of pre- 
dicting whether tomorrow will be a clear day, quoted by Bode and 
Shannon.! Since clear days are in the majority, and there are no days 
with negative precipitation to balance days when there is precipitation, 
the probability function is very much lopsided. With this function, the 
average point, which is the one given by a prediction minimizing the 
mean-square error, might be represented by a day with a light drizzle. 
To a man planning a picnic, however, such a prediction would have no 
value at all. He is interested in the probability of having a really clear 
day, since even a small amount of precipitation would ruin a picnic. 

The theory of the optimum filter of the present chapter also assumes 
a linear filter in that the differential equation relating the input to 
the output of the filter is a linear equation with constant coefficients. 
This is clearly a self-imposed limitation, taken with the purpose of 
simplifying the theory and with the knowledge that such filters are easily 
synthesized out of RC circuits. For controlled systems that have time- 
varying characteristics, such as the ballistic guidance problem of Chap. 
13, such filters are clearly not suitable. In this case of time-varying 
systems, the proper filter should also have time-varying characteristics. 
If the filter is still linear, so that the principle of superposition holds, the 
input-output relationship is still controlled by the impulse response. But 
now this response is a function h of two time variables ¢ and i*. tis 
the time instant when the response is taken, ¢* is the time instant when 
the impulse is applied. Then the output y(t) can be calculated from the 
input x(t) as follows: 


y(t) = | hitt — alt — 1) dr (16.71) 


The optimum filter problem is then that of first determining A(r,t) and 
then of finding a way to actually implement this optimum response to an 
impulse. 


1 Bode and Shannon, op. cit. 


CHAPTER 17 


ULTRASTABILITY AND MULTISTABILITY 


In the preceding chapters, we have indicated how control systems 
of great complexity can be designed to give almost any specified per- 
formance. Of course, the greater the complexity of the system, the 
greater is the chance of malfunction caused by mistakes in assembling 
the system or the failure of an individual component of the system. 
Therefore, for complex control systems, the problem of the reliability 
of the design in actual operation becomes one of utmost importance. In 
this and the last chapter, we shall consider this problem from two differ- 
ent points of view. 

In this chapter we shall discuss the possibility of building into the 
system a certain measure of flexibility and adaptability so that incidental 
and unexpected mistakes in design will be automatically corrected by the 
control system itself without external human aid. Such a control system 
is thus capable of learning how to behave properly and has almost the 
homeostatic mechanisms of living organisms, which enable them to survive 
under varying conditions of environment. Naturally this concept of 
self-adjustments in a complicated system came from the study of the 
behavior of living creatures, because there these characteristics are most 
evident. In fact, our discussions in this chapter are based upon a remark- 
able book by W. R. Ashby? on the origin of the nervous system’s unique 
ability to produce adaptive behavior. There may be different opinions 
about how the brain of an animal is actually constructed. But our task 
is simply to indicate that it is possible to achieve such adaptive behavior 
by mechanical means. Whether the suggested mechanism is the only 
one possible does not concern us here. 

17.1 Ultrastable System. For simplicity, let us consider an autono- 
mous system specified by two variables yı and y2, such as those considered 
in Sec. 10.5. The phase plane is then the yry2 plane. If ¢ is the time, 
then the simultaneous differential equations determining the behavior 
of the system can be written as 


os = fiyn Yt) 


ys = falyny;t) 


1 W., R. Ashby, “Design for a Brain,” John Wiley & Sons, Inc., New York, 1952. 
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Gir 
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where we have included in the functions fı and f: an extra parameter ¢, 
indicating that the functional relationships between dy:/di and dy2/di 
on one hand, and yı and yz on the other are fixed only if the parameter ¢ 
is fixed. In particular, we shall allow ¢ to take a series of discrete values. 
The pattern of behavior of the system is determined by lines of the loci of the 
point (y1,y2) as time increases, starting from various initial points in the 
phase plane. Clearly then, there are as many different patterns of 
behavior of the system as there are different values of the discrete param- 
eter¢. For instance, the linear system discussed in Sec. 10.5 (with yı = y 
and y2 = y) has the parameter ¢. If ¢ is capable of taking five different 
values, one negative, less than —1; one negative, between — 1 and 0; one 
equal to 0; one positive, between 0 and 1; and finally one positive, greater 
than 1, then the five patterns of behavior of the system are indicated by 
Figs. 10.12 to 10.16. Another example would be 


BD = aul) + ay 
J (17.2) 
T = aalt)yı + az(t)y2 | 


where the coefficients a1, G12, @21, aNd azz are monotonic functions of ¢. 
Then there are as many different sets of these coefficients as there are 
different values of the parameter ¢. Each set of the coefficients gives a 
definite pattern of behavior. 

Some of the patterns of behavior will be stable in that all lines of 
behavior tend to a point in the phase plane, the stable equilibrium point. 
Some of the patterns of behavior will be unstable in that the lines of 
behavior tend to diverge from the equilibrium point. Satisfactory per- 
formance of the system naturally requires stability. We shall achieve 
the desired adaptive behavior of the system if we can cause the system 
to reject automatically the unstable patterns of behavior and to retain 
the stable patterns of behavior. Now see what happens if we surround 
the desired equilibrium point of the system by a closed boundary and if 
we build the system with a switching device such that the parameter ¢ will 
jump to a different value whenever the line of behavior of the system 
crosses the boundary. If we start with a pattern of behavior as shown 
in Fig. 17.1a at the point Po, the pattern of behavior is unstable, and the 
system will cross the boundary at the point P;. At the instant of cross- 
ing, the switching device acts so that the parameter ¢ jumps to a differ- 
ent value, and the pattern of behavior of the system changes to that of 
Fig. 17.1b. This pattern is also unstable, and the system moves from 
point P, to Pe, where it crosses the boundary again. The switching 
device changes {, and the pattern of behavior becomes that of Fig. 17.1c. 
But this pattern of behavior, although it contains a stable equilibrium 
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point, requires the system to move out of the boundary at Ps. Thus 
the switching device operates a third time and changes the pattern of 
behavior to that of Fig. 17.1d. This is the stable pattern, and the 
system moves from the point Pz to the equilibrium point Ps. This 
pattern will be retained because, in a condition of stability, the system 
will not cross the boundary and thus will not activate the switching 
device to change ¢. 


¥. 


2 
y 
Cue 
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Therefore, with only the addition of the switching device and a pre- 
scribed switching boundary in the phase plane, the system is made to 
seek stability automatically, rejecting unstable patterns and retaining 
stable patterns of behavior. Furthermore, the switching action in 
changing the parameter ¢ can be entirely random. If the stable pattern 
is attained by the first switching, so much the better. But the end 
result is the same, whether the system has to switch once or three times. 
Stability is always achieved. Therefore we are able to obtain purposeful, 
goal-seeking behavior by purely mechanical means. Such a system is 
thus automatically stable; it does not have to be designed to have stabil- 
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ity, it will become stable by learning. Therefore it has something more 
than just stability and will be called an wltrastable system, after Ashby. 
Our two-variable autonomous system can be generalized to an n-vari- 


able system with the variables y where += 1,..., %. Then the 
differential equations can be written as 

dy; 

a = Fil ys, Yo; e. e Yip ee Yast) for = I, 2 yh (17.3) 


where ¢ is the parameter. ¢ jumps to a different value whenever the 
line of behavior in the phase space y; crosses the switching boundary. 
The switching boundary here is a closed hypersurface ofn — 1 dimensions 
in the phase space of n dimensions. Such a system is also ultrastable. 

17.2 An Example of an Ultrastable System. In order to demonstrate 
the behavior of an ultrastable system, Ashby constructed a relatively 
simple system of four variables, which he called the homeostat.! The 
four variables yı, Ya Ys and ys are the angular deflections of four magnets 
whose motion is heavily damped. The position of each magnet is con- 
trolled by four coils, each of which is fed with currents generated by the 
angular position of the four magnets. Because of the heavy damping, the 
motion of the magnets is slow, and the inertial forces can be neglected. 
The equations of motion can be obtained by equating the turning torques 
produced by the coils to the damping torques. Thus 


d 
= = A141 + aiy? + 13Y3 + Qiy 


d 
A = Qeiy1 +. Ae2Ya + M343 + Qa 

(17.4) 
TE = A311 + @32Y2 + Q33Y3 Ej Azyya 


T = G41Y1 + Y2 + Oasys F Oasys 


The magnitude of the coefficients a can be modified by the experimenter 
by regulating the current in the coils with a variable potentiometer. The 
sign of the a’s can also be changed by the commutators in the coil circuits. 
In addition, for each magnet, one of the coils is fed by current passing 
through a uniselector which has 25 possible positions. The position 
of the uniselector will jump randomly to a different one whenever the 
angular deflection of that magnet reaches 45 degrees in either direction. 
Thus four of the coefficients ax, for i, 7 = 1, 2, 3, 4, are each capable of 
having one of the 25 values chosen randomly by the four uniselectors. 
For each magnet, or each variable y; we have a block diagram similar 
to that shown in Fig. 17.2. 


1W.R. Ashby, Electronic Eng., 20, 379 (1948). 
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The switching boundary of the homeostat is thus a “cube” in the 
four-dimensional phase space centered around the origin with sides cor- 
responding to 90-degree angular motion. For every setting of the a’s by 
the experimenter, the four uniselectors give 254 = 390,625 combinations 
of the four coefficients controlled by them. Therefore, for every hand 
setting, there are 390,625 patterns of behavior of the homeostat, some 
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stable, some unstable. The unstable patterns will, however, be auto- 
matically rejected. 

The ultrastability can now be demonstrated. First, for simplicity, 
a single unit is shown arranged to feed back into itself through a single 
uniselector; the other coils are disconnected. The behavior of the single 
magnet is shown in Fig. 17.3, where the upper curve is the deflection 
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Fig. 17.3 


of the magnet and the lower curve indicates the uniselector action. At 
D., the magnet is disturbed by hand; but the uniselector position happens 
to give a stable pattern of behavior, and the deflection is promptly 
corrected. At R,, the feedback is reversed by hand. The old uniselector 
position now makes the system unstable, and the deflection of the magnet 
reaches the switching boundary (the dotted line in the figure). The 
uniselector acts. After one jump of the uniselector, the pattern becomes 
stable, as shown by the test disturbance at Do. At Re, the feedback 
is again reversed by hand. This time it takes four random jumps of the 
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uniselector to obtain stability. At Ds, the system is again shown to be 
stable. 

As the next example, we have two magnets yı and yz interacting. The 
coefficient a; is set by experimenter, and the coefficient a: by the uni- 
selector. All the other coefficients are set to be zero. For each setting 
of az, there are then 25 different patterns of behavior due to the 25 
settings of the uniselector. The results of experiment can be shown as 
Fig. 17.4, where the two upper curves indicate the deflections of yı and 
yo; the lowest curve indicates the action of the uniselector. At Dj, the 
setting of the uniselector happens to be a stable one, and the deflections 
yı and yz are in the same sense. At Ri, the sign of the coefficient aa 


Ry 


J = —— Switching Boundary 


D2 


Ro 
—— 74—Switching Boundary 


J2 


——r Time 


ck ia i 5 Siena 


Fic. 17.4 


is changed by reversing the polarity of the coil for the y: magnet fed 
with current from the yı magnet. This induces instability and causes 
the yı deflection to reach the switching boundary. One jump in the 
uniselector setting corrects the situation. At De, a test disturbance 
shows that the system is now stable, with yı and yz of opposite signs as 
expected. At Rə, the sign of az is again changed to that at D,; and 
the uniselector setting is again unstable. Now it takes three jumps 
of the uniselector to reach stability. At D3, the system is seen to be 
stable again, with deflections of y, and yz in the same sense. 

As an example of the adaptability of an ultrastable system to even 
an unforseen situation, a situation not thought of until the machine has 
been built, we consider the course of events shown in Fig. 17.5. Here 
three magnets yi, ys, and ys are interacting. The behavior of the system 
is at first stable, as shown at Dı. The deflections yı and y; are in the 
same sense, but ye is in the opposite sense to them. Now at J, we sub- 
ject the homeostat to a new, unexpected circumstance by joining the 
magnets yı and yz so that they must move together. This additional 
constraint 1s counter to the pattern of behavior of the prevailing setting 
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of the uniselector. The system becomes unstable and the consequent 
large deflection causes the uniselector to act. Three unstable patterns 
are successively rejected before the terminal stable pattern is reached, as 
shown at Do At R, the connection between the magnet yı and ys is 
broken; the system again becomes unstable and requires new action of 
the uniselector. — 


2 D; D, 


-Æ= 2- ý -~ Switching Boundary 


Yo 


Fie. 17.5 


17.3 Probability of Stability. In the previous section, we have dem- 
onstrated the characteristic adaptive behavior of the ultrastable system 
in seeking stable patterns of behavior under any circumstances. The 
question then arises: Is this searching for stability always crowned with 
success? What is the probability of success? If we taken an autono- 
mous system of n variables as specified by Eq. (17.3), each value of the 
discrete parameter ¢ gives an ordinary dynamic system. The whole 
range of the parameter ¢ then gives an assembly of autonomous dynamic 
systems of n variables. We may define the probability of stability within 
the switching boundary in the phase space as follows. Pick a point 
P(y;) of the phase space, and construct an infinitesimal neighborhood 
dV around this point P. We can then find the differential probability 
dp as the fraction of the assembly of dynamical systems that will have a 
stable equilibrium point within the volume dV. Now integrate dp over 
the phase space enclosed by the switching boundary. This then gives 
the system’s general probability of stability p corresponding to the specified 
switching boundary. 

Needless to say, the actual calculation of this general probability of 
stability is a very difficult mathematical problem. In order to gain 
some ideas about this probability, Ashby made a trial calculation for an 
assembly of linear systems of the form 


dyi ft 
z = tayi vJ =1,2,...,0 (17.5) 
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There is only one equilibrium point, the origin. The question of stability 
of the system is solved by considering the determinantal equation 


òy = l fori =] 
6; = Ofori #7 


If the roots à all have negative real parts, then the system is stable. 
In fact, the root \ is called the latent root of the matrix ay. The prob- 
ability of stability is thus the probability that the latent roots of the 
sample matrix a; belonging to the assembly will all have negative real 
parts. Ashby took the simplest possible distribution, the rectangular 
distribution. That is, each element ay of the matrix has an equal chance 
of being an integer between —9 to +9, inclusive. The sampling of ay 
was done with the aid of the table of random numbers. When n = 1, 
the probability of stability is obviously 4. For other orders of the system, 
Ashby tested the stability by using Hurwitz’s rule. His results are 
shown in Table 17.1. It is seen that the probability of stability p 
steadily decreases with increase in the order of the system. As an 
approximation, the probability is 44”. 


[az — dd] = 0 (17.6) 


TABLE 17.1 


AEEA i Ps AE 
Number Number / Per cent 
tested | found stable | stable 


320 77 24 


2 
3 100 12 12 
4 100 l 1 


The probability of stability can be increased by limiting the a,; to 
favorable values. For instance, we can make all diagonal elements 
of the matrix have zero or negative values. Then if the variables are 
not interacting, the system will be always stable. The probability of 
stability for one variable, or n = 1, is now clearly unity. When n = 2, 
the probability is $. Ashby’s test results are shown in Table 17.2. 


TABLE 17.2 


Number Number | Per cent 
tested | found stable! stable 


2 120 87 72 
3 100 55 55 


The probability of stability p is thus higher, but nevertheless it decreases 
as the number of variables is increased. From these investigations it 
seems reasonable to say that the probability of stability of a randomly 


t A. Hurwitz, Mathematischen Annalen, 46, 273 (1875). 
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constructed system steadily decreases as the system becomes more com- 
plicated. Large systems are very much more likely to be unstable than 
to be stable. 

17.4 Terminal Fields. Following Ashby, we shall now give a definite 
name to the pattern of behavior at any one setting of the parameter ¢. 
The pattern in the phase space will be called the field of lines of behavior. 
The fields will be different for different values of the parameter. The 
final stable field after the switching action on the parameter will be called 
the terminal field. The action of an ultrastable system is thus mainly 
the search for the terminal field. It is thus of primary importance to 
know the average number N of switching actions necessary to reach the 
terminal field. This number N is very simply related to the probability 
of stability p of the ultrastable system. The probability of reaching the 
stable terminal field at the first action of the switching action is evidently 
the probability p itself. The probability of not reaching the terminal 
fiedisg = 1 — p. If the switching action is perfectly random, then the 
probability that the second field will be stable is still p and the probability 
that the second field will be unstable is g. The relative probability that 
the second field will be terminal is thus pg. The relative probability 
that the second field will not be terminal is g?. Hence we arrive at the 
conclusion that the relative probability for reaching the terminal field 
at the mth switching action is pg”~?. The average number N of switch- 
ing actions for reaching the terminal field is thus 


m= 


When y is very small, for very large systems, the number N of switching 
actions necessary for reaching the stable terminal field will be very large. 
That is, the search for the terminal field will be long and will seem to 
follow a devious road. 

A terminal field may be singular in the sense that only a very small 
fraction of the lines of behavior tend to the equilibrium point, while 
other lines of behavior will diverge from the equilibrium point and cross 
the switching boundary. Such a field will be terminal only if the line 
of behavior from the switching boundary happens to be one of the small 
former group. It will be shown presently that such singular terminal 
fields are not favored. If, among all possible fields of an ultrastable 
system, there is a certain fraction of such singular fields, the fraction 
of such fields used as terminal fields is much smaller. To show this, let 
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I be the fraction of the surface of the switching boundary from which 
the lines of behavior will reach the equilibrium point. For example, for 
the fields shown in Fig. 17.1a and b, k = 0. For the field shown in 
Fig. 17.1c, k is approximately 3. For the field shown in Fig. 17.1d, 
k= 1. Now let f(k) dk be the fraction of all possible fields of the ultra- 
stable system having k between k and k + dk. f (k) is thus the distribu- 
tion function of the possible fields of the ultrastable system, and by 
definition 


i * f(b) dk = 1 (17.8) 


Then because only the lines of behavior that start from the part k of the 
switching boundary can produce a terminal field, the relative probability 
of having the terminal field with its k in the range from k to k + dk is 
kf(k) dk. Thus the distribution function g(k) of the terminal fields is 
related to the distribution function f(k) of the possible fields of the ultra- 
stable system by 


kf(k) 
12 lea oem (17.9) 
Í kif(k’) dk 
Evidently 
i ' o(k) dk = 1 (17.10) 


The terminal fields are thus definitely crowded toward larger values 
of k, as shown by Fig. 17.6. Singular terminal fields are thus not favored. 
The actual terminal-field distribution is, however, not necessarily the 
potential terminal-field distribution 
calculated by Eq. (17.9). The rea- 
son is that any equilibrium state 
within a terminal field is subject to 
random disturbances. Ifthe equilib- 
rium point in the phase space is 
closed to the switching boundary, 
then even a relatively small disturb- 
ance might put the instantaneous 
Fra. 17.6 state of the system outside the 
switching boundary and thus destroy 
that field. Therefore, for a terminal field to have a high probability of 
being retained under random disturbances, the equilibrium must have a 
central location in the switching boundary. Forinstance, as shown in Fig 
17.7 the field C is definitely more stable than the fields A and B. The 
field B contains both an unstable equilibrium point and a limit cycle. 
To put this concept of stability under random disturbances into a 
quantitative form we have to introduce the probability o of retaining a 
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terminal field after a single disturbance. If the field contains a single 
stable equilibrium point P, as shown in Fig. 17.7, and if the distribution 
function of the random disturbances from P is specified, say by a Gaussian 
distribution, then e is simply the integral of this disturbance distribution 
function over the part of the phase space enclosed by the switching 
boundary. If the terminal field contains a limit cycle S, then ø is the 
average of the probability of retaining the field, taking as the equi- 
librium point each point on the limit cycle, and weighting this prob- 
ability according to the proportion of time the system will spend at that 


(a) (b) (c) 
Fra. 17.7 


point of the limit cycle. We can then assign a probability o to each 
terminal field. Let the distribution function of the terminal fields in e 
be o(c); i.e., the probability of finding a terminal field with o in the 
range from c tog + de is o(s) de. Thus 


i aries (17.11) 
The actual terminal-field distribution function is, however, ¥(c), with 
i OE A (17.12) 


To calculate ¥(c) in terms of ¢(c), we observe that the distribution 
y(r), being the actual final terminal-field distribution, will not be altered 
by random disturbances. Secondly, after one disturbance the fields with 
a value of e between c and e + de, (e) do in relative number, will have 
the probability o of being retained and the probability (1 — e) of being 
destroyed. The total relative number of fields destroyed by one random 
disturbance is thus 


fh 1 - dW) do 


The new terminal fields are distributed according to the potential ter- 
minal fields, 7.e., according to (e). Therefore the total relative number 
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of terminal fields in the range o to e + do after one random disturbance is 
oplo) do + gle) do i (1 —o Ho") do’ 


But this relative number is proportional to y(r) de, because random dis- 
turbances should not change the final distribution ¥(c). Thus, if C is 
the constant of proportionality, 


o [owo + 90) fd — ow) de’ | = He) 


By integrating this expression with respect to e from o = Otoc=1,C 
can be shown to be unity with Eqs. (17.11) and (17.12). Therefore we 
have 


ple) + glo) [ (Lo We") da! = Wo) 


In this equation the integral is just a constant independent of e. Thus 
¥(c) is proportional to v(s)/(1 — o). Or, by using Eg. (17.12), 
m glo) 
yle) pas a E p 1 glo’) do’ (17.13) 
0 (1 — 0’) 


Equation (17.13) allows the determination of the actual terminal-field 
distribution from the potential terminal-feld distribution. Since the 
potential distribution can be calcu 
lated from the distribution of all pos- 
sible fields of the ultrastable system 
by Eq. (17.9), we can, in theory at 
least, obtain the actual terminal-field 
distribution y(r) from the specified 
properties of the ultrastable system. 
Equation (17.13) shows, furthermore, 
that the actual terminal-field dis- 

Fic. 17.8 tribution (e) is far more crowded 

toward larger values of o than (c) 

is, as expected by our previous intuitive arguments. This fact is in- 

dicated in Fig. 17.8. It should be noted that the particular type of 

random disturbances will affect only the calculation of the distribution 

function g(s), while the relationship between ¥(c) and g(c), as specified by 

Eq. (17.13), is independent of it and holds for any form of the distribution 
of disturbances. 

17.5 Multistable System. As shown in the preceding section, the 
number NV of switching actions necessary for reaching a terminal field 
is expected to be 1/p, where p is the general probability of stability 
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of the fields of an ultrastable system. Since we have concluded that p 
decreases to very small values for large systems, the number N may be 
very large. For instance, if there are 100 variables in the system, 
p = 1/21, and N = 2100 = 108°. Even if we allow for 10 switching 
actions per second, the average time for reaching a terminal field will 
still be 3 X 10” centuries. Such a long settling time may well be con- 
sidered to be infinite, and the ultrastable system will practically never 
reach a stable terminal field. Therefore, for large systems, or just where 
the principle of ultrastability for automatically reaching stable behavior 
is important, we find the concept to be impractical. 

To remedy this situation, we must increase the probability of stability 
of the system. One way to do this would be a compromise. We require 
the design of the system to be such that the fields of the ultrastable 
system are limited to those which are stable under expected operating 
conditions. Only local and minor adjustments need be made by the 
switching action. In other words, the system is designed according to 
the conventional methods without using the principle of ultrastability. 
Ultrastability and switching are introduced only when trouble is expected. 
For instance, we can design an autopilot for an airplane according to the 
principles discussed in the previous chapters. But we have an appre- 
hension that the assembly mechanic might plug the control signal from 
the autopilot to the aileron in a reverse manner, so that the aileron down 
signal actually produces aileron up motion. If the mechanic actually 
makes this mistake, then the autopilot will not stabilize the airplane; 
instead, the autopilot-airplane system will be unstable, with diverging 
rolling motion. However, the designer’s misgiving can be quieted by 
introducing ultrastability just at this point. The signal connection is 
made to switch automatically whenever the rolling motion exceeds the 
specified values, the switching boundary. Then the system becomes an 
ultrastable system of two possible fields, one stable, one unstable. A 
maximum of only one switching action is necessary to reach stability, in 
spite of the fact that there may be a very large number of variables in 
the autopilot-airplane system. The point is, of course, that we do not 
need to leave everything to chance. . We can design for stable operation 
under almost all conditions and greatly reduce the choice of fields of 
behavior by providing safeguards only for those contingencies expected 
to occur. Hence it is a compromise between the principles of conven- 
tional control design and the principle of ultrastability. 

For living organisms, no prior knowledge of the conditions of the sur- 
roundings can be assumed, and therefore it is not possible to increase 
the probability of stability by limiting the choice of fields of behavior 
of the system. Ashby discovered a different way of increasing the prob- 
ability of stability. He observed that for a very complicated system 
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containing a large number of variables, any single disturbance or change 
of operating conditions directly affects only a relatively small number of 
these variables. Thus if the variables directly disturbed can be isolated 
from the rest and form an ultrastable system by themselves, then the 
probability of stability for that particular type of disturbances can be 
greatly increased. For instance, if the number of variables directly 
influenced at any one disturbance is five, instead of 100 as assumed in the 
first paragraph of this section, the expected time of reaching its terminal 
field is only 3.2 seconds, with 10 switching actions per second. Thus 
if the 100 variables are actually divided into 20 groups of five variables 
each, forming 20 separate ultrastable systems, the total time required 
for adapting to a completely new set of operating conditions will be 
20 X 3.2 = 64 seconds. This is a tremendous reduction from the 
3 X 10! centuries required for a completely connected ultrastable system 
of 100 variables. 

Of course a system composed of 20 separate systems of five variables 
each does not have the flexibility and the rich response of a system of 
100 interacting, connected variables. If, however, any one disturbance 
involves only five variables, then separate systems of five variables can 
be made to be equivalent to the system of 100 variables by making the 
association of the variables in any subsystem change according to the 
disturbance. If the disturbed variables are yı, Y2, Y3, Ya and ys, then 
these five variables are associated to form an ultrastable system of five 
variables. If a later disturbance affects the variables y2, Ys, Y10, Ys, and 
y99, then these five variables are associated to form an ultrastable system. 
This phenomenon of ever-changing organization of variables into sub- 
systems according to operating conditions is called by Ashby the dis- 
persion of behavior. Dispersion can be actually achieved by making 
the functions fi(y1,y2, . . . ,Yxif) in Eq. (17.8) zero when the point 
(Y1,Y2, . >. Yn) is within a certain region of the phase space. Then 
those y; will be constants with respect to time and are tentatively only 
parameters with respect to other variables. In our example above, for 
the first disturbance, the point (y1,yo, . . . ,Yn) is in a region of phase 
space such that f; is zero except when 7 = 1, 2, 3, 4, and 5. For the 
second disturbance, f; vanishes except for 7 = 2, 5, 10, 98, and 99. It 
is evident that this behavior of the functions f; only means that various 
thresholds exist for the derivatives dy,/dt. Such thresholds are naturally 
to be expected in any real system. Therefore dispersion is something 
easy to obtain. 

A system of ultrastable subsystems organized with the possibility of 
dispersion is called by Ashby a multistable system. A multistable system 
obviously has the adaptive behavior of an ultrastable system simply 
because it is composed of ultrastable subsystems. However it differs 
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from the single ultrastable system of an equal number of variables in 
the time required for reaching a terminal field. The multistable system 
has a very much shorter settling time and makes the principle of ultra- 
stability practically realizable. Moreover, a multistable system, by 
making successive tentative adaptations as response to successive disturb- 
ances, shows learning by steps, or serial adaptation. This is a character- 
istic universally observed in living things. Furthermore, since the second 
and subsequent adaptations to a disturbance will certainly alter the 
parameter of the system, the recurrence of a disturbance identical to 
the first disturbance will not generally reproduce the behavior of the 
system at the first adaptation. This is indeed the dispersion of behavior. 
In other words, as the system ‘‘grows older” it becomes “wiser,” and 
its behavior is appropriate for a wider range of activities than just for 
coping with the single prevailing disturbance. 


CHAPTER 18 


CONTROL OF ERROR 


In the preceding chapter, we have shown how the principle of ultra- 
stability can make the control system insensitive to accidental errors 
and occasional failures of the components by the simple device of chang- 
ing the characteristics of the system whenever instability occurs. Since 
an ultrastable system will automatically seek stability, the control 
system, when designed, actually embodies unstable fields of behavior as 
well as stable fields of behavior. In other words, during the design of 
an ultrastable system, we make no attempt to distinguish stability from 
instability, to separate the right fields of behavior from the wrong fields 
of behavior. Errors of behavior are merely treated as a probability, but 
otherwise unspecified. In this chapter, we shall approach the reliability 
of a complex control system from a different point of view: we shall 
specifically introduce errors into the system and ask how the system 
should be designed so that the system will give satisfactory performance 
in spite of the errors. That is, we wish to know how to control the error. 

This subject of control of error is now in its early period of develop- 
ment. The control of error can be discussed for only the most elementary 
operations and the present theory is wholly due to J. von Neumann.! 
Our discussion in this chapter is then an exposition of Neumann’s work. 
Its purpose is to serve as an introduction to this very important topic 
and to indicate the need for much further investigation. 

18.1 Reliability by Duplication. It is common knowledge that the 
reliability of a system can generally be increased by the simple expedient 
of duplication. For instance, if a simple system as shown in Fig. 18.la 
has the characteristic that when it fails to operate, it merely gives no 
output, then to guard against probability of failure, we can duplicate 
the system with identical systems and put all in parallel, as shown in 
Fig. 18.16. If the probability of failure of the original system is p, a 
number between 0 and 1, then the probability of failure of each individual 
unit in the parallel system is also p. If the units in the parallel system 
are independent of each other, the failure of the parallel system will 


1 J. von Neumann, “Probabilistic Logics and the Synthesis of Reliable Organisms 
from Unreliable Components,” Printed notes of lectures given at the California 
Institute of Technology, Pasadena, Calif., 1952. 
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occur only when every unit fails. The probability of failure of the 
parallel system is thus p”. By increasing the number n of duplications, 
we can make this probability very small indeed. 

However, failure of a component of a control system generally does 
not cause zero output. Instead, the effect is much more damaging: 
the system will still give an output, but the wrong output. Then simple 
duplication of the system, as described above, will not be effective, 
because a wrong output when mixed with a correct output results in a 
wrong output. Hence, in such a malfunction of the system, as opposed 
to failure, the probability of the malfunction of the parallel system 


X(s) ¥(s) 


n-parallel 
systems 


(b) 
Fig. 18.1 


is equal to the probability of the malfunction of the single system; and no 
improvement of reliability results. Therefore the problem of control 
of error is a deeper and more difficult problem than it seems to be at 
first sight. Nevertheless, as will be seen later, the principle of duplica- 
tion, 7.¢., the necessity of increasing the number of components, remains 
basic. What has to be discovered is a new organization of these com- 
ponents for controlling the error, because the simple parallel organization 
of Fig. 18.16 is not effective. 

18.2 Basic Elements. To simplify the analysis, we shall not discuss 
the case where the input and the output are continuous functions, but 
we choose an elementary component with input and output capable 
of only two discrete values, 0 or 1, 2.¢., the input is either on (activated) 
or off, the output is either on (activated) or off. The characteristic 
of the element is then determined by the relation between the state of 
the input and the state of the output. We shall always assume a single 
output, but there may be more than one input to the element. There 
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is also a time lag between the output and the inputs in the sense that the 
output is activated only at a certain specified time interval after the 
inputs are activated. Such an element then has the properties of a 
relay circuit. 
To deseribe the characteristic of the element, we introduce four kinds 
of input: excitory input; inhibitory input; per- 
Excitory Input ————> manently excitory, or live, input; and a kind of 
input which is never activated, or a grounded 
input. They are represented by symbols accord- 
T i , ingtoFig. 18.2. The element is then represented 
by a circle with inputs on the left side and the 
Grounded Input T a single output on the right. Within the circle 
Fie. 18.2 there is a number k, meaning that the number of 
activated excitory inputs must be greater than 
the number of activated inhibitory inputs by k or more, for the out- 
put to be on. Thus Fig. 18.3a represents an element whose output is 
on only if both inputs a and b are on. It can be called an ab element. 
Figure 18.3b represents an element whose output is on if either a or b 
or both are on. It can be called ana + b element. Figure 18.3¢ repre- 
sents an element whose output is on only if the input a is off. It can 


me) ar a a 
(a) (b) (c) 


Fig. 18.3 


Inhibitory Input ~o 


be called an a~! element. Incidentally, if we consider the inputs to be the 
conditions for a given statement to be true (on) or false (off), then the 
three elements of Fig. 18.3 represent the three fundamental operations 
of Boolean algebra. In a digital computer using binary numbers, these 
elements are the basic elements for its operation. If memory is neces- 
sary for the computing operation, such a device 
can be made from the second element of Fig. 18.3 ° 
by feedback as shown in Fig. 18.4. Once the input 
a is activated, the output will be on, even if ais later 

Fra. 18.4 
turned off. 

For later discussions, having three basic elements to deal with is a 
disadvantage. However, all three are really special cases of one funda- 
mental element. Consider the element represented by Fig. 18.5, called 
the Scheffer stroke. Since the two live inputs are always excited, we 
can eliminate them from the diagram and represent the element as shown 
in the right diagram. The output of the Scheffer stroke is thus on if 
neither a nor b is on, or if either a or b is on; but the output is off if both 
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a and b are on. The three basic elements of the last paragraph can be 
constructed out of the Scheffer stroke as shown in Fig. 18.6. Of course 
both the ab element and the a + b element involve two Scheffer-stroke 
elements in series; thus the time lag will be twice as large as for the a7! 
element having only one Scheffer stroke. But since our notion of time lag 
merely indicates that inputs precede output, the exact magnitude of time 
lag is of no consequence. Operations differing only in time lag will be 
considered equivalent. Therefore all three basic operations can be 
represented by a single Scheffer-stroke element. 


(a/b) Ts em 


Scheffer Stroke 
Fig. 18.5 


a element 


Fie. 18.6 


The choice of a single fundamental element is, however, not unique. 
There can be other choices besides the Scheffer stroke. But our choice 
is convenient for the following discussions. We shall consider first the 
control of error in an operation represented by the Scheffer stroke; then 
any complex operation built up from Scheffer strokes can be similarly 
designed. 

18.3. Method of Multiplexing. Applying the concept of duplication 
for improving the reliability, we substitute any single input by a bundle 
of inpuis with n individual lines. Thus in the system of the single 
Scheffer stroke, we would have n lines for the a input, labelled a;, where 
i = 1,2, .. . ,n;nlines for the b input, labelled b; fori = 1,2, ... ,n; 
and n lines in the output bundle. We then specify the fraction ô, where 
0 < 6 < 4, such that if (1 — ô)n lines of the output bundle are on or off, 
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the output as a whole is considered to be on or off. If én lines of the 
output bundle are on or off, then the output as a whole is considered 
to be off or on, respectively. Any intermediate value is considered to be a 
malfunction. 6 is thus the fiduciary level. The problem is how to con- 
struct the system, using Scheffer-stroke elements, such that the proba- 
bility of malfunction can be reduced with a specified probability of errors 
in the input bundles and a specified probability of malfunction of the 
individual Scheffer-stroke elements. 

A first approach to the problem would be to take a line a; from the a 
input bundle and a line b; from the b input bundle, and to use these as 
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Fie. 18.7 


the two inputs to a Scheffer-stroke element. The organization of the 
system is thus as shown in Fig. 18.7. It is evident that if almost all 
lines of both input bundles are on, then almost all lines of the output 
bundle will be off. If almost all lines of both input bundles are off, then 
almost all lines of the output bundle will be on. This over-all behavior 
seems to be satisfactory. However, a more careful consideration will 
show that it is not so. Since, for output of Scheffer stroke to be off, both 
inputs have to be on, an error either in the a bundle or in the b bundle 
will be sufficient to cause an error in the output bundle. Therefore, if 
the output is supposed to be off, then the error in output is the sum of 
errors in the input bundles. Similarly, if the output is supposed to be on 
and only one input bundle is off, then the error in output is the same as 
that in the input bundle. If the output is supposed to be on and both 
input bundles are off, then an error in the output bundle requires a 
simultaneous error in both input bundles; and therefore the error level 
in the output bundle is less than the error level in the input bundles. 
Therefore the error level in the inputs is not maintained at the output. 
Some situations result in a magnification of error, while others result in a 
reduction of error. There is then a dispersion of error. This is undesir- 


CONTROL OF ERROR 273 


able, because dispersion of error causes the number of activated lines of 
the output bundle to drift to the undetermined region between ôn lines 
and (1 — 6)n lines, and thus increases the chance of malfunction. 

To suppress the dispersion of error, we can introduce a restoring com- 
ponent of the system as follows. We take each line of the output bundle 
from the executive component of Fig. 18.7, executive in the sense of carry- 
ing out the Scheffer-stroke operation for the over-all system, and split 
it into two lines. We then obtain 2n lines. Then we permutate these 
2n lines so that the order of lines is now made random. ‘Taking the 
successive pairs of lines and using them as inputs to a Scheffer-stroke 
element, we then again obtain a bundle of n output lines. This is 


n lines n lines 


Fie. 18.8 


indicated diagrammatically in Fig. 18.8. Let there be aon activated 
lines in the original bundle. The fraction of the output lines not acti- 
vated is clearly apap = a?. The fraction a: of the output lines activated 
is thus 

ay = l = ap (18.1) 


If a is the probability of having the original lines activated, then, 
provided n is large, the probability of having the transformed lines acti- 
vated is ay. This is not a restoring component yet. But if we put two . 
such units in series, the probability a, of having the final lines activated 
18 

a, = l — a? = 1 — (1 — af)? = 2a} — af (18.2) 


The series system is now a restoring component for the following reason. 
Fig. 18.9 shows the relationship between a: and ap. ae is equal to ao 
when 

as — 2a + a = 0 


or when ay = 0, 4(+/5 — 1), or 1. Thus if ap lies between 0 and 
3(4/5 — 1) = 0.618034 
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æ 18 smaller than ay; if a lies between 2(4/5 — 1) and 1, œ: is greater 
than a. Therefore the action of the restoring component is to drive 
the probability of activating the output towards the limits 0 and 1, and 
thus to reduce the dispersion of error caused by the executive component. 

On the basis of the preceding discussion, then, our system for error 
control consists of an executive component of n individual Scheffer-stroke 
elements, followed by a restoring component composed of two units of 
Fig. 18.8 in series, each made of n Scheffer-stroke elements and a random- 
izing device. Therefore, for each Scheffer-stroke element of perfect 

accuracy, we have to expand the 
ee system to a complex system of 3n 

i Scheffer-stroke elements. With any 

fixed fiduciary level 6 and with given 

probabilities of error in the input 

lines and the Scheffer-stroke ele- 

ments, we shall see that we can 

make the reliability of the over-all 

system as high as we please by in- 

creasing n. Basically then, the 

principle is still reliability by duplica- 

£ v5-1 l @ tion. However, our analysis in this 

ear section gives us a specific plan of 

organizing these elements. This 

particular method of synthesizing a reliable system out of unreliable 
elements is called the method of multiplexing by von Neumann. 

18.4 Error in Executive Component. We shall now compute the 
error of the multiplexed Scheffer-stroke system described in the preceding 
section and show that the error is indeed controlled. We observe first 
that the direct source of error is the individual Scheffer-stroke element 
itself, in either the executive component or in the restoring component. 
Let the probability for each individual element of making a mistake be «. 
If we assume that there are r parallel Scheffer-stroke elements and that 
they are independent in their operation, then the probability for the 
elements to fail when in the system is still e. Therefore, of the r parallel 
elements, on the average er will make mistakes.’ This number is also 
the most likely number of mistakes in the n parallel elements. The 
probability of making some other number of mistakes is less. In fact, 
the problem of determining the probability po(o,¢,7) of having p mistakes 
in r parallel elements with an individual probability of failure equal to is 
the classical problem of random sampling. For large r, it is known? that 


1 See for instance H. Margenau and G. M. Murphy, “The Mathematics of Physics 
and Chemistry,” p. 422, D. Van Nostrand Company, Ine., New York, 1943. 
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U- — er)? 
—é)r 


olp, Er) = a e pee (18.3) 
Therefore the probability distribution po(p,¢,r) is a normal distribution 
with a mean at er and a mean deviation equal to ~/e(1 — e)r. 

The other source of error in the executive component is the misfitting 
of lines from the input bundles into the individual Scheffer-stroke ele- 
ment. For instance, if a fraction & of the a input bundle of Fig. 18.7 
is activated and if a fraction n of the b input bundle is activated, we 
would expect a similar fraction of the output bundle to be inhibited. 
But if the particular a; for ith element is activated and b; happens to be 
nonactivated, the output of the ith element is still activated, even with 
no mistake on the part of the element itself. Let ¢ be the fraction of the 
output bundle of the executive component that is activated. Then the 
nunaber of output lines effectively inhibited is (1 — ¢)n. The total num- 
ber of activated lines in the a input bundle is én, in the b input bundle 
it is yn. The number of effective, or properly fitted, a-input lines is, 
however, only (1 — ¢)n. The difference [E — (1 — ¢)]n is ineffective. 
The number of ineffective b-input lines is [n — (1 — ¢)]n. Therefore 
the number of effective output lines is (1 — ¢)n, the number of ineffective 
output lines due to misfitting activated a input is [f — (1 — §)]n, the 
number of ineffective output lines due to misfitting activated b input is 
în — (1 — £)|n, and, finally, the number of ineffective output lines due to 
nonactivated input lines is the remainder 


aber ber Ol pen ea ae rape 


The number of possible effective combinations of such a classification of 
output is thus! 


1 
2 el 


n! 
(E= Oa ea n e ale e 
On the input side, the number of possible combinations of n qanput 
lines, with én ae (1 — £n nonactivated, is 


n! 
(re) [1 — n]! 
The number of possible combinations of n b-input lines, with yn activated, 


(1 — y)n nonactivated, is 
n! 


(nn) (1 — nn]! 
Therefore the probability pı of having a fraction ¢ of the output acti- 
vated with fractions £ and y of the inputs activated and with perfect 
1 See for instance H. Margenau and G. M. Murphy, op. cit., p. 415. 
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operation of the individual Scheffer-stroke elements, is 


pln, jn) 


n! 


_ (=a = 0 =e = 0 = OE = = = 
n! n! 
[nA — an]! in] — na]! (18.4) 
s o ü OEA- nlm -d 
Ce ea 1 Olin ily = ăla ae a ln 


Clearly, for the enumeration to have meaning, none of the four classes 
of output lines discussed in the preceding paragraph can have less than 
zero number. Whenever that happens, the probability drops to zero. 
That is, pı is zero whenever the following conditions are violated, 


ae eal 
ee (eae Oar (18.5) 
ee a 4 
and 2—&-yn-£>0 


We shall now simplify the expression of Eq. (18.4) under the assump- 
tion that n is large. When n is large, the factorials can be approximated 
by their asymptotic value, given by Stirling’s formula 


n! x ~/ Qn erneit (18.6) 


By using Eq. (18.6) we can write the approximate expression for pı as 


l z 
pilé, tn) = / oan a/a gtn (18.7) 
where 
EIERNE. 20 a EEE 
FSG a a a 
and 


0 = (+t -1log (+t -— 1) +n +t- 1) log m +t- 1) 
+ (1 — ¢) log (1 — t) + (2 — £ — n — ț) log (2 — £~ 7- ċ¢) 
— log — (1 — &) log (1 — £) — n logn — (1 — n) log (1 — „) (18.9) 


Differentiating 0 with respect to ¢, we have 


E ES) 
w -oe n) ety) 


and 
026 ji 1 i Í 


oo bee a aaa a (18.11) 


ey S 
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We find by using these equations that at ¢ = 1 — e = a = 
Furthermore, because of the conditions of Eq. (18. J == 18 always 


x 
positive. Therefore the only zero of 8 is at ¢ = 1 — n. Then if n is 
very large, the negative exponential in Eq. (18.7) shows that we need 
only consider ô near its zero. But at the zero of 0, where ¢ = 1 — én, 
074 ji l 1 
ee ae tee a ee e Spe ee 
7 tag tet Sed Se 7 


Thus near ¢ = 1 — &, 6 is approximated by 

| Goa 0 ces ply 

0 ~ = 18.12 

2 EL Bal — 9) m 
When n is large, a is a relatively slowly varying function of ¢ in com- 
parison to the exponential of Eq. (18.7). Thus we can take the value 
of a at the point ¢ = 1 — &. Or 
n E 
E Cl eq) 
Therefore, finally, the approximate expression for pi(&,¢;n) with very 
large n is 


a~ (18.13) 


/ ) i ee ae 

pilm, tn) = — l E 18.14 
V2r&(1 — &)n(1 — n)n 

Hence 91 is also a normal distribution with respect to ¢. The mean is at 

(1 — &)n, and the mean deviation is ~/&(1 — &)n(1 — n)n. 

We shall make a final modification of the probability p,(&1,¢;n) by 
passing to the continuous distribution function W(f;§,n;n) for large 
values of n. If W(¢3£,n:n) dë is the probability of having the number of 
activated output lines from ¢n to fn + 1 = n(¢ + 1/n), then dt = 1/n, 


and this probability is exactly pi(£,7,¢;n). Therefore 
) 1 -5| t— (1 En) Į 
W3E932) = napi = — E VEIL—£)n(1—1)/n 
Vré — El — 9)/n 


(18.15) 


W(t: nin) di in general is then the probability of having the fraction of 
activated output lines between ¢ and ¢ + dt, with specified fractions 
¿and 7 of the input bundles activated. The size of bundles is determined 
by the number of lines n. W is a Gaussian distribution with the mean 


1 — & and the mean derivation ~/é(1 — &)n(1 — n)/n. An equivalent 


way of expressing the result is to write 


p= ~ a) + foe, (18.16) 
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where y denotes a random variable distributed according to the normal 
Gaussian distribution, with the mean zero and the mean deviation equal 
to unity. Then Eq. (18.16) states that ¢ is distributed according to the 
Gaussian distribution with the mean 1 — & and the mean deviation 
VE — &)n(1 — n)/n. Therefore Eqs. (18.15) and (18.16) express the 
same fact. Equation (18.16) is, however, often more convenient to use. 
We can now combine the two sources of errors and add to the ¢ distri- 
bution of Eq. (18.16) the effects of imperfect individual Scheffer-stroke 
elements. We can rewrite Eq. (18.3) in a way similar to Eq. (18.16): 


p= er + vel — ery (18.17) 


Now in our executive component there are two classes of the Scheffer- 
stroke elements. One type, ¢ in number, is supposed to have activated 
output. A mistake will decrease the number of activated lines by one. 
The g mistakes in this type of elements are given by Eq. (18.17) as 


g=en+~ fel — AEn y (18.18) 


The other type, (1 — ¢)n in number, is supposed to be nonactivated. A 
mistake will increase the number of activated lines by one. The g 
mistakes in this type of elements are distributed as 


g=cd(l—Snt+vVel—od(1—dny (18.19) 


q — q is then the additional number of activated output lines due to 
errors of the elements themselves. According to Eqs. (18.18) and (18.19), 
we have 


of q= ME enve — O11 ny Vell —Ofny (18.20) 


The last two terms of the above equation are the difference of two 
normally distributed random variables. We shall see presently that the 
difference is again a normally distributed random variable. 

Consider two normally distributed random variables zı and z. with 
the mean zero, and the mean deviations sı and oz, respectively. Then 


te (18.21) 
£2 = o2Y 


Or, with Wi(z:) denoting the probability distribution function of 2, 
W o(ze) the probability distribution function of 2s, 
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Now if the two random variables are independent of each other, the joint 
probability of having z, in the range zı to z1 + dz; and zs in the range 
29 to Za + AZo is 

Walz) W alga) dz: dz. 


We now introduce new variables zı and za defined by 


Tı = 21 — 29 

Lo = 21 Be 
watt 

2i = AER + T2) 

22 = g(t — Tı) 


The joint probability in the new variables is then 


LW, (aga W, (25%) du, dae 


By integrating this joint probability with respect to z, from — œ to », 
we obtain the probability IW (21) dzi, where W(z1) is the probability 
distribution of zı = zı — z» Thus 


Wa) = 3 | Í Wi (ats) Ws (25%) arte 
Lf? GCE, 


Arai» 


e ~ Qo ae i m E E 
Er A a (1/802) + (1/80?) 


1 v1 2 
- 1 _ eae) 
Vr Ve + 03 


— 2 = Vo + oy (18.22) 


By integrating the joint probability with respect to zı, we can show that 
ata=Vetay (18.23) 


Hence the difference or the sum of two normally distributed and inde- 
pendent random variables is again a normally distributed random 
variable with the square of the mean deviation equal to the sum of the 
squares of the mean deviations of the original random variables. This 
ability to add or subtract normally distributed random variables with 
zero mean values is to be expected, since they have equal probabilities 
for positive or negative values. 

With the relation of Eq. (18.22), we can write the result of Eq. (18.20) 


aS 
g —q=Rey—Hn+vel —eny 


Therefore we can write 
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Let (g — q)/n = Ag be the correction of the fraction ¢ for imperfect 
Scheffer-stroke elements; then we have 


at = 2-H + f= 2y (18.24) 


We can now combine Eqs. (18.16) and (18.24), and the corrected 
fraction ¢’ of activated output lines is 


i 
repta ertag t Ey 


alesse E E E A een, 


N 
4 (2 y (18.25) 


The last two terms of Eq. (18.25) represent the sum of two independent, 
normally distributed random variables. We can thus use Eq. (18.28). 
Therefore, finally, writing ¢ in place of ¢’, we have from Eq. (18.24) 


č = (1 — n) + 2(& — 3) 
4 q — 2e) E(1 — é)n(1 — 4) + ell — ©) y (18.26) 


n 


where y is a normally distributed random variable with the mean equal 
to zero and the mean deviation equal to unity. Equation (18.26) speci- 
fies the performance of the executive component of our multiplexed 
Scheffer-stroke system with the fractions £, », and ¢ of the inputs and 
output activated and with ¢ as the probability of failure of the individual 
Scheffer-stroke elements. 

18.5 Error of Multiplexed Systems. After having calculated the per- 
formance of the executive component of our multiplexed Scheffer-stroke 
system, we find that the rest of the computation is very easy. Each 
unit (Fig. 18.8) of the restoring component is really equivalent to the 
executive component. For the first stage of the restoring component, the 
inputs are the split output lines from the executive component. Thus 
instead of two different fractions £ and y, we have the same fraction ¢. 
Therefore, if u is the fraction of activated output of the first stage, then, 
according to Eq. (18.26), 

242 
w=) tade p H EEU EA gan 
Similarly if v is the fraction of activated output of the second stage of 
the restoring component, then 


y= (1 — u?) + 2e (u? — 4) + ee ee ead _ eu) y (18.28) 
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Equations (18.27) and (18.28) have a first term identical with Eq. 
(18.1). The additional terms come from the imperfect elements and 
from the statistical distribution of errors. 

With any specified £, n, e and n, Eqs. (18.26) to (18.28) enable us to 
compute the distribution function of y, the fraction of activated output 
lines of the complete Scheffer-stroke system. We can make this some- 
what clearer by reverting to the notation of probability distribution 
functions. Thus, for instance, Eq. (18.26) is equivalent to 


j (Sy) e i 
exp 1-5 ae — 26)E(1 — Sn(l — 9) + (1 — ©) 


TEPEE 


n 


Wliitimn) = 


The probability distribution function of v, W (v; m;n), is thus the result 
of integrating with respect to ¢ and u the joint probability of ¢, u, and v. 
Thus 


O N. EES E. TEE 
Woks) = ap [C= B= nF | o 


n 


(e) 


n n 


í ¢— {1 — &) + Zelën — 4)} ; 
exp l- T — 26)?&(1 — E)y(1 — n) + e(l — ©) 


: i 
| [P A 


n 
p— 1i HA- P 
Ri — w) (1 — §? + e(l — 6) 


n 


1 


2 


s~ (=) tel- T 
— J — 26) u (l — u)? + €(1 — j | (18.29) 
n 

We shall now show that under proper conditions we can obtain almost 
perfect performance of the multiplexed Scheffer-stroke system by increas- 
ing m. Consider a given fiduciary level 6. Perfect performance requires 
that v < ô for the nonactivation of output is implied by £ > 1 — 6 and 
n > 1 — Sfor the activation of both inputs; and that y > 1 — 6 by either 
< dandy > 1— d0ré>1—éandy <ô. Let usassumethatn is so 
large and e so small that terms of order « and 1/+/n can be neglected in 
Eqs. (18.26) to (18.28). Then , 


tal-i) pxl- væl- 


282 ENGINEERING CYBERNETICS 


Or 
v = 1 — (én ~ £7)? = =whenn> 1 (18.30) 
eX] 


Nowlet &=1—a,7=1-—8,anda,6 < 6;so that £ > 1 — ôand n > 1— ô. 
Then Eq. (18.30) gives 


y = Ua t BA Hee 


Hence v = 0(8). Similarly, Eq. (18.30) gives v = 1 — 0(6*) for & <ô 
and n> 1— ô, or £> 1— and 7 <ô. Furthermore, Eq. (18.30) 
also gives y = 1 — 0(84)foré < dandy < ô. Therefore perfect reliability 
of the multiplexed Scheffer-stroke system can be indeed obtained with 
n — ©, provided e and ô are small. 

When n is large but not infinite, the calculation is somewhat tedi- 
ous because of the necessity of evaluating the integral of Eq. (18.29). 
Although the asymptotic values of the integral can be determined by the 
classical methods, we shall not enter into this calculation here. Instead 
we shall cite an example from von Neumann where 6 = 0.07, 1.¢., activa- 
tion of at least 93 per cent of the lines of a bundle represents a positive 
message; activation of at most 7 per cent of the lines represents a negative 
message. He then found that the probability e of malfunction of the 
individual Scheffer-stroke elements must be less than 0.0107 for con- 
trolling the error. Fore > 0.0107, the probability of malfunction of the 
over-all system cannot be made arbitrarily small by increasing n. For 
e = 0.005, or 4 per cent chance of failure, von Neumann gave the numer- 
ical results in Table 18.1. It is seen that for as many as 1000 lines in a 
bundle, the reliability is rather poor. In fact it is inferior to the original 
1 per cent of «. But a 25-fold increase in n will give extreme reliability. 

For systems organized originally in Scheffer-stroke elements, the tech- 
nique of multiplexing discussed in the preceding section can also be 
applied without change. We replace each Scheffer-stroke element in 

TABLE 18.1 


ô = 0.07, e = 0.005 
Number of lines, n Probability of malfunction 


1,000 2.7 X 1072 
2,000 2.6 X 1073 
3,000 2.5 X 10-4 
5,000 4 X 1078 
10,000 1.6 X 1078 
20,000 2.8 X 107” 
25 ,000 1.2 X 1072 


the original system by the Scheffer-stroke system of 3n elements, each 
with its executive component and restoring component. The error in 
the over-all system can be computed from the error of the individual 
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Scheffer-stroke systems asshown in the previous discussions. Practically, 
this is a very tedious calculation. However, for the purpose of estimat- 
ing the order of magnitude of n required for specified reliability, we can 
consider the entire system to be equivalent to a single Scheffer-stroke 
element, and use the result of this over-all reaction directly. This will 
be done in the following section. 

18.6 Examples. To obtain an idea of the magnitude of required bun- 
dle size, let us consider a computing machine of 2,500 vacuum tubes. 
Each of the tubes is supposed to be actuated on the average of once every 
5 microseconds. We specify that the machine should on the average 
run 8 hours before making a single mistake. During this period, the 
number of actuations of a single tube is 


+ X 8 X 3,600 X 10° = 5.76 X 10° 


Consider each tube as a Scheffer-stroke element. Then the specified 
probability of malfunction, considering each tube as an independent unit, 
is 1/(5.76 X 10°). However, there are 2,500 interconnected tubes in 
the system. A mistake in any one among the 2,500 tubes will mean a 
mistake of the machine. Therefore, considering each tube as one unit 
within the system, the specified probability of malfunction should be 
only 1/2,500 of the above value, or 1/(2,500 X 5.76 X 10)? = 7 X 10-“. 
We see then that the final probability of malfunction is the same as that 
obtained by considering the whole system of 2,500 tubes as a single 
Scheffer-stroke element. This possibility greatly simplifies the calcula- 
tion of required number of lines n in the multiplexed system. 

If we assume that the fiduciary level ô and the probability e of failure 
of the tubes are the same as specified in Table 18.1, then the specified 
probability of malfunction obtained above will require n = 14,000, 
according to the table. Therefore, in order to make the machine as 
reliable as specified, it will be necessary to multiplex the system 14,000 
times. This would mean the replacement of every single tube in the 
machine by a system of 3n = 3 X 14,000 = 42,000 tubes. The original 
machine of 2,500 tubes now becomes a giant of 105,000,000 tubes. This 
is clearly not practicable. 

Now take a second example, a plausible quantitative picture of the 
organization of the human nervous system. The number of neurons 
involved is usually given as 10%. But considering the presence of 
synaptic end bulbs and other possible autonomous subunits, this number 
is certainly too low. It ought to be a few hundred times larger. Let 
us take the number of basic elements to be 10%. The neurons can be 
actuated up to a maximum of about 200 times a second. But the average 
rate of actuations must be a good deal less; say 10 actuations per second. 
We shall further assume that a mistake in our nervous system is serious 
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and should not happen in the time interval of a human life span. Take 
the error-free interval to be 10,000 years. During this interval, the 
total number of actuations in the system of 10! elements is 


1018 X 10,000 X 31,536,000 X 10 = 3.2 X 10” 
Thus the probability of malfunction should be 
1/(8.2 X 1075) = 3.2 X 107: 


Again, assume that the basic nervous elements have the properties speci- 
fied in Table 18.1; then an extrapolation from that table gives n = 28,000. 

However, our calculation needs a correction: if the human nervous 
system is indeed multiplexed 28,000 times, the number of basic elements 
in the nonmultiplexed system is not 10" as assumed above; the number 
should be reduced by a factor of 1/(3 X 28,000). Then the probability 
of malfunction should be increased by the factor 3 X 28,000. The cor- 
rected probability of malfunction is now 2.7 X 107%. Table 18.1 
then gives n = 22,000. Further iteration will not change this value 
appreciably. 

These examples show that, while our method of controlling the error 
by multiplexing is quite conceivably applicable to the microcomponents 
of nervous systems, it is nevertheless impractical for engineering control 
systems with the present technologies. One obvious direction of future 
development is the reduction in bulk and power requirement of the 
elements. The transistor is a great improvement over the vacuum tube 
from this point of view. Hence the method of multiplexing may yet 
become practical in the future. Another direction of investigation 
would be a deeper analysis of the process of controlling error. The organi- 
zation of the executive component and the restoring component of our 
basic system for the Scheffer stroke is, after all, only one possible organi- 
zation. We are fortunate in that such a crude attempt already is 
successful in demonstrating the possibility of increasing the reliability. 
There are probably other plans of organization of the duplicated com- 
ponents which will produce the same degree of reliability with a smaller 
number of components. In other words, only a beginning has been made 
in the technique of error control in automatic systems. For control 
engineers, there is as yet no applicable solution to the problem. 
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